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DRUG TARGETS IN CANDIDA ALBICANS 



Field of the Invention 

The present invention is concerned with the 
5 identification of genes or functional fragments 

thereof from Candida, albicans which are critical for 
growth and cell division and which genes may be used 
as selective drug targets to treat Candida albicans 
associated infections. Novel nucleic acid sequences 
10 from Candida albicans are also provided and which 

encode the polypeptides, which are critical for growth 
of Candida albicans , 

Background of the Invention 

15 Opportunistic infections in immunocompromised 

hosts represent an increasingly common cause of 
mortality and morbidity, Candida species are among 
the most commonly identified fungal pathogens 
associated with such opportunistic infections, with 

20 Candida albicans being the most common species. Such 
fungal infections are thus problematical in, for 
example, AIDS populations in addition to normal 
healthy women where Candida albicans yeasts represent 
the most common cause of vulvovaginitis. 

25 Although compounds do exist for treating such 

disorders, such as, amphotericin, these drugs are 
generally limited in their treatment because of their 
toxicity and side effects. Therefore, there exists a 
need for new compounds, which may be used to treat 

30 Candida associated infections in addition to 

compounds, which are selective in their action against 
Candida albicans . 

Classical approaches for identifying anti-fungal 
compounds have relied almost exclusively on inhibition 

35 of fungal or yeast growth as an endpoint . Libraries 
of natural products, semi -synthetic , or synthetic 
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chemicals are screened for their ability to kill or 
arrest growth of the target pathogen or a related 
nonpathogenic model organism. These tests are 
cumbersome and provide no information about a 
5 compounds mechanism of action. The promising lead 
compounds that emerge from such screens must then be 
tested for possible host- toxicity and detailed 
mechanism of action studies must subsequently be 
conducted to identify the affected molecular target. 

10 

Summary of the Invention 

The present inventors have now identified a range 
of nucleic acid sequences from Candida albicans , which 
15 encode polypeptides, which are critical for its 

survival and growth. These sequences represent novel 
targets which can be incorporated into an assay to 
selectively identify compounds capable of inhibiting 
expression of such polypeptides and their potential 
20- use in alleviat-ing diseases or conditions assoeiated- 
with Candida albicans infection. 

In a first aspect of the invention, the invention 
relates to a nucleic acid molecule encoding a 
polypeptide which is critical for survival and growth 
25 of the yeast Candida albicans and which nucleic acid 
molecule is selected from the group consisting of SEQ 
ID NO: 1 to SEQ ID NO : 9. More preferably where 
nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 3 and still 
30 more preferably consisting of SEQ ID NO : 1 and SEQ ID 
N0:2 . 

In a preferred embodiment of this aspect of the 
invention, the nucleic acid molecule is either RNA or 
DNA, and in one embodiment, cDNA. The invention also 
35 relates to a first nucleic acid molecule capable of 

hybridising to a second nucleic acid molecule encoding 
a polypeptide which is critical for survival and 
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growth of the yeast Candida albicans and which second 
nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO : 9 under high 
stringency conditions. In yet another embodiment, the 
5 invention relates to an antisense molecule comprising 
a first nucleic acid molecule capable of hybridising 
to a second nucleic acid molecule encoding a 
polypeptide which is critical for survival and growth 
of the yeast Candida albicans wherein the second 

10 nucleic acid molecule is selected from the group 
consisting, of SEQ ID NO; 1 to SEQ ID NO: 9. 

The invention also relates to cells containing a 
nucleic acid molecule encoding a polypeptide which is 
critical for survival and growth of the yeast Candida 

15 albicans and which nucleic acid molecule is selected 
from the group consisting of SEQ ID NO: 1 to SEQ ID 
NO: 9, wherein said cells are bacterial or eukaryotic. 
The invention further relates to polypeptide encoded 
by a nucleic acid molecule which is critical for 

20 survival and growth of the yeast Candida albicans, and 

which nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 9 and to 
polypeptide having any of amino acid sequences 
selected from the group consisting of SEQ ID NO: 10 to 

25 SEQ ID NO: 16 . 

The invention also relates to a recombinant DNA ' 
construct comprising a nucleic acid molecule encoding 
a polypeptide which is critical for survival and 
growth of the yeast Candida albicans and which nucleic 

30 acid molecule is selected from the group consisting of 
SEQ ID NO: 1 to SEQ ID NO: 9, wherein the nucleic acid 
molecule is DNA. Preferably the DNA is an expression 
vector and preferably the expression vector includes 
an inducible promoter and/or a reporter molecule. The 

35 invention also relates to recombinant DNA constructs 
comprising a first nucleic acid molecule wherein the 
first nucleic acid molecule or a molecule 
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complementary to the first nucleic acid molecule is 
capable of hybridising to a second nucleic acid 
molecule encoding a polypeptide which is critical for 
survival and growth of the yeast Candida albicans and 
5 which second nucleic acid molecule is selected from 
the group consisting of SEQ ID NO: 1 to SEQ ID NO: 9 
wherein the first nucleic acid molecule is inserted in 
the antisense orientation. 

The invention also relates to cells containing a 

10 recombinant DNA construct comprising a nucleic acid 

molecule encoding a polypeptide which is critical for 
survival and growth of the yeast Candida albicans and 
which nucleic acid molecule is selected from the group 
consisting of SEQ ID NO : 1 to SEQ ID NO : 9, wherein 

15 said cells are bacterial or eukaryotic. 

In another aspect of this invention, the nucleic acid 
molecules or polypeptides of this invention are 
provided in a pharmaceut ically acceptable carrier, 
diluent or excipient . 
-20 - The invention further relates to a. method. for _ 

treating a Candida alJbi cans-associated disease 
comprising the step of: administering a composition of 
matter comprising an antisense nucleic acid molecule 
capable of binding to a Candida alJbi cans-originating 

25 nucleic acid sequence selected from the group 
consisting of SEQ ID NO : 1 to SEQ ID NO : 9 . 

In another aspect of this invention, the 
invention relates to a Candida albicans cell 
comprising an induced mutation in the DNA sequence 

30 encoding a polypeptide which is critical for survival 
and growth of the yeast Candida albicans wherein the 
polypeptide is encoded by a nucleic acid molecule is 
selected from the group consisting of SEQ ID NO: 1 to 
SEQ ID NO: 9. 

35 The invention also relates to a method of 

identifying compounds which selectively modulate 
expression or functionality of polypeptides or 
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metabolic pathways in which these polypeptides are 
involved and which polypeptides are crucial for growth 
and survival of Candida albicans , which method 
comprises: contacting a compound to be tested with one 
or more Candida albicans cells having a mutation in a 
nucleic acid molecule, the nucleic acid molecule 
encoding a polypeptide which is critical for survival 
and growth of the yeast Candida albicans and which 
nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 9 wherein the 
mutation results in overexpression or underexpression 
of said polypeptide in addition to contacting one or 
more wild type Candida albicans cells with said 
compound; and monitoring the growth and/or activity of 
said mutated cell compared to said wild type; wherein 
differential growth or activity of said one or more 
mutated Candida cells is indicative of selective 
action of said compound on a polypeptide or another 
polypeptide in the same or a parallel pathway. The 
-invention further relates to compounds identified by 
this method. The compounds, can be provided in a 
pharmaceutical composition with a pharmaceut ically 
acceptable carrier, diluent or excipient therefor. 

The invention also relates to a method of 
identifying DNA sequences from a cell or organism 
which DNA encodes polypeptides which are critical for 
growth or survival of said cell or organism, which 
method comprises: preparing a cDNA or genomic library 
from said cell or organism in a suitable expression 
vector which vector is such that it can either 
integrate into the genome in said cell or that it 
permits transcription of antisense RNA from the 
nucleotide sequences in said cDNA or genomic library; 
selecting transf ormants exhibiting impaired growth and 
determining the nucleotide sequence of the cDNA or 
genomic sequence from the library included in the 
vector from said transf ormant In a preferred aspect 
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of this invention, the cell or organism is a yeast or 
filamentous fungus and more preferably the cell or 
organism is any of Saccharomyces cere\risia.e , 
Schizosaccharomyces pombe or Candida albicans . 
5 The invention further relates to antibody capable 

of specifically binding to a polypeptide of this 
invention. The invention also relates to 
oligonucleotides comprising a fragment of from 10 to 
120 contiguous nucleotides of a nucleic acid molecule 

10 encoding a polypeptide which is critical for survival 
and growth of the yeast Candida albicans and which 
nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO: 9. 
Preferably the oligonucleotide comprises a fragment of 

15 from 10 to 50 contiguous nucleotides. 

Brief Description of the Figures 

Figure 1 is a diagrammatic representation of 

plasmid pGALlPNiST- 1 . 
20 Figure 2 is a diagrammatic representation of 

plasmid pGALlPSiST- 1 . 

Figure 3 is an exemplary growth curve of a 

Candida albicans strain demonstrating antisense- 

induced reduction in growth in response to 
25 intracellular antisense expressing specifically 

targeting one of the nucleic acid sequences of this 

invention . 

Figure 4 is a representative growth curve of a 
Candida albicans knock out strain as compared to wild 
30 type Candida albicans , 

Detailed Description of the Invention 

According to a first aspect of the invention 
there is provided a nucleic acid molecule encoding a 
35 polypeptide which is critical for survival and growth 
of the yeast Candida albicans and which nucleic acid 
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molecule comprises any of the sequences of nucleotides 
illustrated in any of Sequence ID Nos . 1 to 9 . 

Whilst the molecules defined herein have been 
established as being critical for growth and 
5 metabolism of Candida albicans, for some of the 

molecules no apparent functionality has been assigned 
by virtue of the fact that no functionally related 
sequences in other prokaryotic or eukaryotic organism 
can be found in respective databases. Thus, 

10 advantageously these sequences may be species specific 
in which case they may be used be used as selective 
targets for treatment of diseases mediated by Candida 
Albicans infection. Thus, in one aspect of the 
invention the nucleic acid molecules preferably 

15 comprise the sequences identified in sequence ID Nos. 
1, 4, 5 to 9. 

In another aspect of the invention the sequences 
have been arranged functionally and of nucleotides 
illustrated in Sequence ID Nos. 2 or 3 are preferred 

20 and even more preferably in Sequence ID No. 2 and 
fragments or derivatives of said nucleic acid 
molecules . 

Letters utilised in the sequences according to 
the invention which are not recognisable as letters of 
25 the genetic code signify a position in the nucleic 

acid sequence where one or more of bases A, G, C or T 
can occupy the nucleotide position. Representative 
letters used to identify the range of bases which can 
be used are as follows: 

30 



35 



M: 


A 


or 


C 


R: 


A 


or 


G 


W: 


A 


or 


T 


S : 


C 


or 


G 


Y: 


C 


or 


T 


K: 


G 


or 


T 


V: 


A 


or 


C or G 
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A or C or T 
A or G or T 
C or G or T 
G or A or T or C 



In one embodiment of each of the above-identified 
aspects of the invention the nucleic acid may comprise 
a mRNA molecule or alternatively a DNA and preferably 
a cDNA molecule. 

10 Also provided by the present invention is a 

nucleic acid molecule capable of hybridising to the 
nucleic acid molecules illustrated in any of Figures 1 
to 9 under high stringency conditions such as 
antisense molecule and which conditions are generally 

15 known to those of skill in the art. 

Stringency of hybridisation as used herein refers 
to conditions under which polynucleic acids are 
stable. The stability of hybrids is reflected in the 
melting temperature (Tm) of the hybrids. Tm can be 

20 approximated by- the formula : - 

81 .5°C+16.6 (Log^o [Na^] +0.41 {%G&C) -600L/L 

wherein L is the length of the hybrids in nucleotides. 

25 Tm decreases approximately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The term "stringency" refers to the hybridisation 
conditions wherein a single-stranded nucleic acid 
joins with a complementary strand when the purine or 

30 pyramidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
favour homologous base pairing whereas low stringency 
conditions disfavour non- homo logons base pairing. 
"Low stringency" conditions comprise, for 

35 example, a temperature of about 3 7°C or less, a 

formamide concentration of less than about 50%, and a 
moderate to low salt (SSC) concentration; or, 
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alternatively, a temperature of about 50°C or less, 
and a moderate to high salt (SSPE) concentration, for 
example IM NaCl . 

"High stringency" conditions comprise, for 
5 example, a temperature of about 42''C or less, a 

formamide concentration of less than about 20%, and a 
low salt (SSC) concentration; or, alternatively, a 
temperature of about 65°C, or less, and a low salt 
(SSPE) concentration. For example, high stringency 
10 conditions comprise hybridization in 0.5 M NaHPO^, 7% 
sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65°C 
(Ausubel, P.M. et al. Current Protocols in Molecular 
Biology;, Vol. I, 1989; Green Inc-. New York, at 
2.10.3) . 

15 "SSC" comprises a hybridization and wash 

solution. A stock 20X SSC solution contains 3M sodium 
chloride, 0 . 3M sodium citrate, pH 7 . 0 

"SSPE" comprises a hybridization and wash 
solution. A IX SSPE solution contains 180 mM NaCl, 

20 9mM NasHPO^ and 1 mM EDTA, pH 7.4. 

The nucleic acid capable of hybridising to 
nucleic acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 

25 nucleotide sequences illustrated in any of Figures 1 
to 9 . 

The DNA molecules according to the invention may, 
advantageously, be included in a suitable expression . 
vector to express polypeptides encoded therefrom in a 

30 suitable host which are critical for growth and 
survival of Candida albicans , 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
the invention operably linked to regulatory sequences, 

35 such as promoter regions, that are capable of 

effecting expression of said DNA fragments. The term 
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"operably linked" refers to a juxtaposition wherein 
the components described are in a relationship 
permitting them to function in their. intended manner. 
Such vectors may be transformed into a suitable host 
5 cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 
aspect, the invention provides a process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed, 
10 transfected or infected with an expression vector as 
described above under conditions to provide for 
expression by the vector of a coding sequence encoding 
the polypeptides, and recovering the expressed 
polypeptides. 

15 The vectors may be, for example, plasmid, virus 

or phage vectors provided with an origin of 
replication, optionally a promoter for the expression 
of said nucleotide and optionally a regulator of the 
promoter. The vectors may contain one or more 

20 selectable markers,- such as, for example, ampicillin 
resistance. 

Regulatory elements required for expression 
include promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 

25 binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine -Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a he,terologous or 

30 homologous promoter for RNA polymerase II, a 

downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 
ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 

35 well known in the art. 

Polynucleotides according to the invention may be 
inserted into the vectors described in an antisense 
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orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 
In accordance with the present invention, a 
5 defined nucleic acid includes not only the identical 
nucleic acid but also any minor base variations 
including in particular, substitutions in bases which 
result in a synonymous codon (a different codon 
specifying the same amino acid residue) due to the 

10 degenerate code. The term ''nucleic acid sequence" 

also includes the complementary sequence to any single 
stranded sequence given regarding base variations. 

The present invention also comprises within its 
scope proteins or polypeptides expressed by the 

15 nucleic acid molecules according to the invention or a 
functional equivalent, derivative or bioprecursor 
thereof . 

The present invention also advantageously 
provides nucleic acid sequences of at least 

20 approx-imately 10 - cont-iguous nucleotides- of a nucleic 

acid according to the invention and preferably from 10 
to approximately 120 nucleotides. In another aspect 
of the invention, nucleotide acid sequences are 
provided from 10 to 50 nucleotides. These sequences 

25 may, advantageously be used as probes or primers to 

initiate replication, or the like. Such nucleic acid 
sequences may be produced according to techniques well 
known in the art, such as by recombinant or synthetic 
means. They may also be used in diagnostic kits or 

30 the like for detecting the presence of a nucleic acid 
according to the invention. These tests generally 
comprise contacting the probe with the sample under 
hybridising conditions and detecting for the presence 
of any duplex or triplex formation between the probe 

35 and any nucleic acid in the sample. 

According to the present invention, these probes 
may be anchored to a solid support. Preferably, they 
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are present on an array so that multiple probes can 
simultaneously hybridise to a single biological 
sample. The probes can be spotted onto the array or 
synthesized in situ on the array. See Lockhart et al . , 
5 Nature Biotechnology, Vol. 14, December 1996, 

"Expression monitoring by hybridization to high- 
density oligonucleotide arrays" . A single array can 
contain more than up to more than a million different 
probes in discrete locations. 

10 Advantageously, the nucleic acid sequences, 

according to the invention may be produced using such 
recombinant or synthetic means, such as for example 
using PGR cloning mechanisms which generally involve 
making a pair of primers, which may be between 

15 approximately 10 to 12 0 nucleotides to a region of the 
gene which is desired to be cloned, bringing the 
primers into contact with mRNA, cDNA, or genomic DNA 
from a cell, performing a polymerase chain reaction 
under conditions which bring about amplification of 
— _20 - the- desired region-,- isolating the -amplified -region or 
fragment and recovering the amplified DNA. Generally, 
such techniques as defined herein are well known in 
the art, such as described in Sambrook et ai 
(Molecular Cloning: a Laboratory Manual, 1989) . 

25 The nucleic acids or oligonucleotides according 

to the invention may carry a revealing label. 
Suitable labels include radioisotopes such as ^^P or 
^^S, enzyme labels or other protein labels such as 
biotin or fluorescent markers. Such labels may be 

30 added to the nucleic acids or oligonucleotides of the 
invention and may be detected using known techniques 
per se. 

The polypeptide or protein according to the 
invention includes all possible amino acid variants 
35 encoded by the nucleic acid molecule according to the 
invention including a polypeptide encoded by said 
molecule and having conservative amino acid changes. 
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Polypeptides according to the invention further 
include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said polypeptides. In 
5 this context, substantial homology is regarded as a 
sequence which has at least 70%, preferably 80 or 90% 
amino acid homology with the polypeptides encoded by 
the nucleic acid molecules according to the invention. 
Nucleic acids and polypeptides which are 

10 particularly preferred are those comprising the 

sequences of nucleotides provided in SEQ ID NOS : 1-3 
and polypeptides provided in SEQ ID NOS: 10-12. 
However, a particularly preferred nucleic acid 
comprises the sequences of nucleotides provided as SEQ 

15 ID NOS: 2 and/or 3, and their corresponding amino acid 
sequences identified as SEQ ID NOS: 11 and 12. 

With reference to the nucleic acids of this 
invention and the protein encoded thereby, amino acid 
sequence SEQ ID NO: 10 is the translation of SEQ ID 
--20- -N0 :1; SEQ -ID NO : 1 is the translat ion of - SEQ ID NO : 2 ; 

SEQ ID NO: 12 is the translation of SEQ ID NO : 3 ; SEQ ID 
NO: 16 is the translation of SEQ ID NO : 4 ; SEQ ID NO: 13 
is the translation of SEQ ID NO : 5 ; SEQ ID NO: 14 is the 
translation of SEQ ID NO : 6 and SEQ ID NO: 15 is the 

25 translation of SEQ ID NO : 7 . 

Nucleotide sequences according to the invention 
are particularly advantageous as selective therapeutic 
targets for treating Candida albicans associated 
infections. For example, an antisense nucleic acid 

30 capable of binding to the nucleic acid sequence 

illustrated in any of SEQ ID NOS: 1-9 may be used to 
selectively inhibit expression of the corresponding 
polypeptides, leading to impaired growth of the 
Candida albicans with reductions of associated 

35 illnesses or diseases (see Figure 5) . 

The nucleic acid molecule or the polypeptide 
according to the invention may be used as a 
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medicament, or in the preparation of a medicament, for 
treating diseases or conditions associated with 
Candida, albicans infection. 

Advantageously, the nucleic acid molecule or the 
polypeptide according to the invention may be provided 
in a pharmaceutical composition together with a 
pharmaceutically acceptable carrier, diluent or 
excipient therefor . 

The present invention is further directed to 
inhibiting expression of nucleic acids according to 
the invention in vivo by the use of antisense 
technology. Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. 
For example, the 5' coding portion or the mature 
protein sequence, which encodes for the protein of the 
present invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 
A DNA oligonucleotide is designed- to be complementary- 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al . Nucl . Acids Res., 
6:3073 (1979); Cooney et al . , Science, 241:456 (1988); 
and Dervan et al . , Science, 251: 1360 (1991), thereby 
preventing transcription and the production of the 
corresponding protein. The antisense RNA 
oligonucleotide hybridises to the mRNA in vivo and 
blocks translation of an mRNA molecule into the 
corresponding protein (antisense - Okano, J.^ 
Neurochem., 56:560 (1991); Oligodeoxynucleot ides as 
Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 

. Antibodies to the protein or polypeptide of the 
present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse, with the polypeptide 
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according to the invention or an epitope thereof and 
recovering immune serum. Monoclonal antibodies may be 
prepared according to known techniques such as 
described by Kohler R. and Milstein Nature (1975) 

5 256, 495-497. 

Antibodies according to the invention may also be 
used in a method of detecting for the presence of a 
polypeptide according to the invention, which method 
comprises reacting the antibody with a sample and 

10 identifying any protein bound to said antibody. A kit 
may also be provided for performing said method which 
comprises an antibody according to the invention and 
means for reacting the antibody with said sample. 

Proteins which interact with the polypeptide of 

15 the invention may be identified by investigating 
protein-protein interactions using the two-hybrid 
vector system first proposed by Chien et al , (1991) . 

This technique is based on functional 
reconstitution in vivo of a transcription factor which 
-20 act i-vat-es a reporter gene . More part icularly -the - - 

technique comprises providing an appropriate host cell 
with a DNA construct comprising a reporter gene under 
the control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 

25 domain, expressing in the host cell a first hybrid DNA 
sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 

30 host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 

35 detecting any binding of the proteins to be 

investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
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product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

An example of such a technique utilises the GAL4 
protein in yeast. GAL4 is a transcriptional activator 
5 of galactose metabolism in yeast and has a separate 
domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 

10 residues encoding the DMA binding domain of GAL4 . 

These binding domain residues may be fused to a known 
protein encoding sequence, such as for example the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 

15 binding domain of GAL4 . These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 

-20 -reporter mol-ecu-le - in a -GAL4 -transcript ion -deficient ' 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
iS-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

25 Further provided by the present invention is one 

or more Candida, albicans cells comprising an induced 
mutation in the DNA sequence encoding the polypeptide 
according to the invention. 

A further aspect of the invention provides a 

30 method of identifying compounds which selectively 
inhibit or interfere with the expression, the 
functionality of polypeptides expressed from the 
nucleotides sequences illustrated in any of SEQ ID 
NOS : 1-9, or the metabolic pathways in which these 

35 polypeptides are involved and which are critical for 
growth and survival of Candida albicans , which method 
comprises (a) contacting a compound to be tested with 
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one or more Candida, albicans cells having a mutation 
in a nucleic acid molecule according to the invention 
which mutation results in overexpression or 
underexpression of said polypeptides in addition to 
5 one or more wild type Candida cells, (b) monitoring 
the growth and/or activity of said mutated cell 
compared to said wild type wherein differential growth 
or activity of said one or more mutated Candida cells 
provides an indication of selective action of said 

10 compound on said polypeptide or another polypeptide in 
the same or a parallel pathway. 

Compounds identifiable or identified using the 
method according to the invention, may advantageously 
be used as a medicament, or in the preparation of a 

15 medicament to treat diseases or conditions associated 
with Candida albicans infection. These compounds may 
also advantageously be included in a pharmaceutical 
composition together with a pharmaceutically 
acceptable carrier, diluent or excipient therefor. 

20 A further aspect of -the invention provides- a 

method of identifying DNA sequences from a cell or 
organism which DNA encodes polypeptides which are 
critical for growth or survival, which method 
comprises (a) preparing a cDNA or genomic library from 

25 said cell or organism in a suitable expression vector 
which vector is such that it can either integrate into 
the genome in said cell or that it permits 
transcription of antisense RNA from the nucleotide 
sequences in said cDNA or genomic library, (b) 

30 selecting transf ormants exhibiting impaired growth and 
determining the nucleotide sequence of the cDNA or 
genomic sequence from the library included in the 
vector from said transf ormant . Preferably, the cell 
or organism may be any yeast or filamentous fungus, 

35 such as, for example, Saccharornyces cGrevisiae , 
Schizosaccharomyces pombe or Candida albicans . 

A further aspect of the invention provides a 
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pharmaceutical composition comprising any of a 
compound, an antisense molecule or an antibody 
according to the invention together with a 
pharmaceutically acceptable carrier, diluent or 
5 excipient therefor. 

The antisense molecules or indeed the compounds 
identified as agonists or antagonists of the nucleic 
acids or polypeptides according to the invention may 
be used in the form of a pharmaceutical composition, 

10 which may be prepared according to procedures well 
known in the art. Preferred compositions include a 
pharmaceutically acceptable vehicle or diluent or 
excipient, such as for example, a physiological saline 
solution. Other pharmaceutically acceptable carriers 

15 including other non-toxic salts, sterile water or the 
like may also be used. A suitable buffer may also be 
present allowing the compositions to be lyophilized 
and stored in sterile conditions prior to 
reconst itution by the addition of sterile water for 

20 subsequent administration . - - Incorporation of the - 

polypeptides of the invention into a solid or semi- 
solid biologically compatible matrix may be carried 
out which can be implanted into tissues requiring 
treatment . 

25 The carrier can also contain other 

pharmaceutically acceptable excipients for modifying 
other conditions such as pH, osmolarity, viscosity, 
sterility, lipophilicity , solubility or the like. 
Pharmaceutically acceptable excipients which permit 

30 sustained or delayed release following administration 
may also be included. 

The polypeptides, the nucleic acid molecules or 
compounds according to the invention may be 
administered orally. In this embodiment they may be 

35 encapsulated and combined with suitable carriers in 

solid dosage forms which would be well known to those 
skilled in the art. 
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As would be well known to those of skill in the 
art, the specific dosage regime may be calculated 
according to the body surface area of the patient or 
the volume of body space to be occupied, dependent 
5 upon the particular route of administration to be 
used. The amount of the composition actually 
administered will, however, be determined by a medical 
practitioner, based on the circumstances pertaining to 
the disorder to be treated, such as the severity of 
10 the symptoms, the composition to be administered, the 
age, weight, and response of the individual patient 
and the chosen route of administration. 

Example 1 

15 Identification of novel drug targets in C, albicans by 
anti- sense and disruptive integration 

The principle of the approach is based on the 
fact that when a particular C. albicans mRNA is 
-inhibit ed--by -producing- the complementary- anti ^ sense — 
20 RNA, the corresponding protein will decrease. If this 
protein is critical for growth or survival, the cell 
producing the anti-sense RNA will grow more slowly or 
will die. 

Since anti-sense inhibition occurs at mRNA level, 
25 the gene copy number is irrelevant, thus allowing 
applications of the strategy even in diploid 
organisms . 

Anti -sense RNA is endogenously produced from an 
integrative or episomal plasmid with an inducible 
30 promoter; induction of the promoter leads to the 
production of an RNA encoded by the insert of the 
plasmid. This insert will differ from one plasmid to 
another in the library. The inserts will be derived 
from genomic DNA fragments or from cDNA to cover- to 
35 the extent possible- the entire genome. 

The vector is a proprietary vector allowing 
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integration by homologous recombination at either the 
homologous insert or promoter sequence in the Candida 
genome. After introducing plasmids from cDNA or 
genomic libraries into C. albicans , transf ormants are 
5 screened for impaired growth after promoter (& thus 
anti-sense) induction in the presence of lithium 
acetate. Lithium acetate prolongs the Gl phase and 
thus allows anti -sense to act during a prolonged 
period of time during the cell cycle. Transf ormants 

10 which show impaired growth in both induced and non- 
induced media, thus showing a growth defect due to 
integrative disruption, are selected as well. 

Transf ormants showing impaired growth are 
supposed to contain plasmids which produce anti -sense 

15 RNA to mRNAs critical for growth or survival. Growth 
is monitored by measuring growth-curves over a period 
of time in a device (Bioscreen Analyzer, Labsystems) 
which allows simultaneous measurement of growth-curves 
of 200 transf ormants . 

20 --Subsequently plasmids can be recovered from the -- 

transf ormants and the sequence of their inserts 
determined, thus revealing which mRNA they inhibit. In 
order to be able to recover the genomic or cDNA insert 
which has integrated into the Candida genome, genomic 

25 DNA is isolated, cut with an enzyme which cuts only 
once into the library vector (and estimated approx. 
every 4 096 bp in the genome) and religated. PGR with 
primers flanking the insert will yield (partial) 
genomic or cDNA inserts as PGR fragments which can 

30 directly be sequenced. This PGR analysis (on ligation 
reaction) will also show us how many integrations 
occurred. Alternatively the ligation reaction is 
transformed to E. coli and PGR analysis is performed 
on colonies or on plasmid DNA derived thereof. 

35 This method is employed for a genome-wide search 

for novel C. albicans genes which are important for 
growth or survival . 
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Materials & Methods 

Construction of pGallPNiST-1 (SEQ ID NO: 21) 

The backbone of the pGALlPNiST-1 vector 
5 (integrative anti-sense Sfil-Notl vector) is 
pGE]yillZf( + ) (Promega Inc.). First, the CaMAL2 
EcoRl/Sall promoter fragment from pDBVSO (D.H. Brown 
et al , 1996) was ligated into EcoRI/ Sal I -opened 
pGEMllZf (+) resulting in the intermediate construct 

10 PGEMMAL2P-1 . Into the latter (Mscl/CIP) the CaURA3 
selection marker was cloned as a Eco47III/ZmnI 
fragment derived from pRM2 . The resulting pGEMMAL2P- 2 
vector was A^otl/Hindlll opened in order to accept the 
J\7otI-stuf fer-Sfil cassette from pPCKlNiSCYCT- 1 

15 (£;agrI/HindIII fragment): pMAL2PNiST- 1 . Finally, the 

plasmid pGALlPNiST-1 was constructed by exchanging the 
Sall/Fcil36II MAL2 promoter in pMAL2PNiST-l by the 
Xhol/Sma.! GALl promoter fragment derived from 

pRM2GALlP. - - - ' - 

20 

Construction of pGallPSiST-1 (SEQ ID ON:22) 

The vector pGALlPSiST-1 was created for cloning 
the small genomic DNA fragments (flanked by Sfil 
sites) behind the GALl promoter. The only difference 

25 with pGALlPNiST-1 is that the hlFNp (stuffer fragment) 
insert fragment in pGALlPSiST-1 is flanked by two Sfil 
sites in stead of a Sfil and a Notl site as in 
pGALlPNiST-1 . To construct pGALlPSiST-1 the ficoRI - 
Hindlll fragment, containing hlFNP flanked by a Sfil 

30 and a Notl site, of p]yiAL2pHiET-3 (unpublished) was 
exchanged by the EcoRI -Hindlll fragment, containing 
hlFNP flanked by two Sfil sites, from YCp50S-S (an E, 
coli / S. cerevisiae shuttle vector derived from the 
plasmid YCp50, which is deposited in the ATCC 

35 collection (number 37419; Thrash et al . , 1985); an 
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EcoRl-Hindlll fragment, containing the gene hlFNP, 
which is flanked by two Sfil sites, was inserted in 
YCp50, creating YCp50S-S) , resulting into plasmid 
pMAL2PSiST-l . The MAL2 promoter from pMAL2PSiST-l (by 
5 a i\^ael-jbail digest) was further replaced by the GALl 
promoter from pGALlPNiST-1 (via a Xhol-FSPl digest) , 
creating the vector pGALlPSiST- 1 . 

Candida, albicans genomic library 

10 * Preparation of the genomic DNA fragments 

A Candida albicans genomic DNA library with small DNA 
fragments (400 to 1,000 bp) was prepared. Genomic DNA 
of Candida albicans B2630 was isolated following a 
modified protocol of Blin and Stafford (1976) . The 

15 quality of the isolated genomic DNA was checked by gel 
electrophoresis. Undigested DNA was located on the gel 
above the marker band of 26,282 bp. A little smear, 
caused by fragmentation of the DNA, was present. 
To obtain enrichment for genomic DNA fragments of the 

20 desired size, the genomic DNA was partially digested. 
Several restriction enzymes (Alul, Haelll and Rsal ; 
all creating blunt ends) were tried out. The 
appropriate digest conditions have been determined by 
titration of the enzyme. Enrichment of small DNA 

25 fragments was obtained with 70 units of Alul on 10 )ig 
of genomic DNA for 2 0 min. T4 DNA polymerase 
(Boehringer) and dNTPs (Boehringer) were added to 
polish the DNA ends. After extraction with phenol- 
chloroform the digest was size-fractionated on an 

30 agarose gel. The genomic DNA fragments with a length 
of 500 to 1,250 bp were eluted from the gel by 
centrifugal filtration (Zhu et al . , 1985). Sfil 
adaptors (5' GTTGGCCTTTT) or (5' AGGCCAAC) were 
attached to the DNA ends (blunt) to facilitate cloning 

35 of the fragments into the vector. Therefore, a 8-mer 
and 11-mer oligonucleotide (comprising the Sfil site) 
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were kinased and annealed. After ligation of these 
adaptors to the DNA fragments a second size- 
f ractionation was performed on an agarose gel . The 
DNA fragments of 400 to 1150 bp were eluted from the 
5 gel by centrifugal filtration. 

* Preparation of the pGALlPSiST-1 vector fragment 
The small genomic DNA fragments were cloned after 

the GALl promoter in the vector pGALlPSIST- 1 . Qiagen- 
purified pGALlPSiST-1 plasmid DNA was digested with 

10 Sfil and the largest vector fragment eluted from the 
gel by centrifugal filtration (Zhu et al , , 1985) . 
Ligation with a control DNA fragment, flanked by Sfil 
sites, was performed as a control. The ligation mix 
was electroporated to MC1061 E, coli cells. Plasmid 

15 DNA of 24 clones was analyzed. In all cases the 

control fragment was inserted in the pGALlPSiST-1 
vector fragment . 

* Upscaling 

All genomic DNA fragments (450 ng) were ligated 
20 into -the pGALlPSiST— 1 vector- (2 0 - ng) . Af ter - 

electroporat ion at 2500V, 40|liF circa 400,000 clones 
were obtained. These clones were pooled into three 
groups and stored as glycerol slants. Also Qiagen- 
purified DNA was prepared from these clones. A clone . 
25 analysis showed an average insert length of 600 bp and 
a percentage of 91 for clones with an insert. The size 
of the library corresponds to 5 times the diploid 
genome. The genomic DNA inserts are sense or anti- 
sense orientated in the vector. 

30 

Candida, albicans cDNA library 

Total RNA was extracted from Candida albicans 
B2630 grown on respectively minimal (SD) and rich 
(YPD) medium as described by Chirgwin et al . in 
35 Sambrook et al 1996. mRNA was prepared from total RNA 
using the Invitrogen Fast Track procedure. 

First strand cDNA is synthesised with the 
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Superscript Reverse Transcriptase (BRL) and with an 
oligo dT-Notl Primer adapter. After second strand 
synthesis, cDNA is polished with Klenow enzyme and 
purified over a Sephacryl S-400 spun column. 
5 Phosphorylated Sfil adapters are then ligated to the 
cDNA, followed by digestion with the Notl restriction 
enzyme. The Sfil/Notl cDNA is then purified and sized 
on a Biogel column A150M. 

First fraction contains approximately 38,720 
10 clones by transformation, the second fraction only 
1540 clones. Clone analysis: 

Fr. I: 22/24 inserts, 16 ; 1000 bp, 4 ; 2000 bp, 
average size: 15 0 0 bp. 

Fr. II: 9/12 inserts, 3 ; 1000 bp, average size: 960 
15 bp cDNA was ligated in a Notl/ Sfil opened pGALlPNiST-1 
vector (anti-sense) 

Can dlda t r an s f o rma t i on 

The host strain used for transformation is a C. 

20 ,aTbica.ns "ura3 mutant, CAI-4, which contains a deletion 
in orotidine-5=-phosphate decarboxylase and was 
obtained from William Fonzi, Georgetown University 
(Fonzi and Irwin) . CAI-4 was transformed with the 
above described cDNA library or genomic library using 

25 the Pichia. spheroplast module (Invitrogen) . Resulting 
transf ormants were plated on minimal medium 
supplemented with glucose (SD, 0.67% or 1.34% Yeast 
Nitrogen base w/o amino acids + 2% glucose) plates 
and incubated for 2-3 days at 3 0°C. 

30 

Screening for mutants 

Starter cultures were set up by inoculating each 
colony in 1 ml SD medium and incubating overnight at 
30°C and 300 rpm. Cell densities were determined using 
35 a Coulter counter (Coulter Zl; Coulter electronics 

limited). 250.000 cells/ml were inoculated in 1 ml SD 
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medium and cultures were incubated for 24 hours at 
3 0°C and 300 rpm. Cultures were washed in minimal 
medium without glucose (S) and the pellet resuspended 
in 650 |al S medium. 8 fil of this culture is used for 
5 inoculating 400 fil cultures in a Honeywell - 100 plate 
(Bioscree^ analyzer; Labsystems) . Each transformant 
was grown during three days in S medium containing 
LiAc; pH 6.0, with 2% glucose/2% maltose or 2% 
galactose/2% maltose respectively while shaking every 

10 3 minutes for 20 seconds. Optical densities were 

measured every hour during three consecutive days and 
growth curves were generated (Bioscreen analyzer ; 
Labsystems) . 

Growth curves of transf ormants grown in 

15 respectively anti-sense non-inducing (glucose/maltose) 
and inducing (galactose/maltose) medium are compared 
and those transf ormants showing impaired growth upon 
anti -sense induction are selected for further 
analysis. Transf ormants showing impaired growth by 

20 virtue of integration into a critical gene are also 
selected. 

Isolation of genomic or cDNA inserts 

Putatively interesting transf ormants are grown in 
25 1 . 5 ml SD overnight and genomic DNA is isolated using 
the Nucleon Ml Yeast kit (Clontech) . Concentration of 
genomic DNA is estimated by analyzing a sample on an 
agarose gel. 

2 0 ng of genomic DNA is digested for three hours 
30 with an enzyme that cuts uniquely in the library 

vector (Sad for the genomic library; PstI for the 
cDNA library) and treated with RNAse. Samples are 
phenol/chloroform extracted and precipitated using 
NaOAc/ethanol . 
35 The resulting pellet is resuspended in 500 |li1 

ligation mixture (1 x ligation buffer and 4 units of 
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T4 DNA ligase; both from Boehringer) and incubated 
overnight at 16*^0. 

After denaturation (20 min 65°C) , purification 
(phenol/chloroform extraction) and precipitation 
5 (NaOAc/ethanol ) the pellet is resuspended in 10 ^1 
MilliQ (Millipore) water. 

PGR analysis 

Inverse PCR is performed on 1 /xl of the 

10 precipitated ligation reaction using library vector 

specific primers (oligo23 5' TGC-AGC-TCG-ACC-TCG-ACT-G 
3' SEQ ID NO: 17 and oligo25 5' GCG-TGA-ATG-TAA-GCG- 
TGA-C 3' SEQ ID NO: 18 for the genomic library; 
3pGALNistPCR primer: 5 ' TGAGCAGCTCGCCGTCGCGC 3' SEQ ID 

15 NO: 19 and SpGALNistPCR primer: 

5 ' GAGTTATACCCTGCAGCTCGAC 3' SEQ ID NO : 2 0 for the cDNA 
library; both from Eurogentec) for 30 cycles each 
consisting of (a) 1 min at 95 °C, (b) 1 min at 57 ""C, 
and (c) 3 min at 72 °C . In the reaction mixture 2.5 

20 units of Taq polymerase (Boehringer) with TaqStart 
antibody (Clontech) (1:1) were used, and the final 
concentrations were 0.2 /iM of each primer, 3 mM MgCl2 
(Perkin Elmer Cetus) and 200 \m dNTPs (Perkin Elmer 
Cetus) , PCR was performed in a Robocycler 

25 (Stratagene) . 

Sequence determination 

Resulting PCR products were purified using PCR 
purification kit (Qiagen) and were quantified by 

30 comparison of band intensity on EtBr stained agarose 

gel with the intensity of DNA marker bands. The amount 
of PCR product (expressed in ng) used in the 
sequencing reaction is calculated as the length of the 
PCR product in basepairs divided by 10 . Sequencing 

35 reactions were performed using the ABI Prism BigDye 
Terminator Cycle Sequencing Ready Reaction Kit 
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according to the instructions of the manufacturer (PE 
Applied Biosystems, Foster City, CA) except for the 
following modifications . 

The total reaction volume was reduced to 15 j^l . 
5 Reaction volume of individual reagents were changed 

accordingly. 6.0 fil Terminator Ready Reaction Mix was 
replaced by a mixture of 3.0 |_il Terminator Ready 
Reaction Mix + 3 . 0 |al Half Term (GENPAK Limited, 
Brighton, UK) . After cycle sequencing, reaction 

10 mixtures were purified over Sephadex G50 columns 

prepared on Multiscreen HV opaque microtiter plates 
(Millipore, Molsheim, Fr) and were dried in a 
speedVac . Reaction products were resuspended in 3 fil 
loading buffer. Following denaturation for 2 min at 

15 95°C, 1 /il of sample was applied on a 5% Long Ranger 
Gel (36 cm well -to-read) prepared from Singel Packs 
according to the supplier's instructions {FMC 
BioProducts, Rockland, ME) . Samples were run for 7 
hours 2X run on a ABI 3 7 7XL DNA sequencer. Data 

20 collection version 2.0 and Sequence analysis version 
3.0 (for basecalling) software packages are from PE 
Applied Biosystems, Resulting sequence text files 
were copied onto a server for further analysis. 

25 Sequence analysis 

Nucleotide sequences were imported in the 
VectorNTI software package (InforMax Inc , North 
Bethesda, MD, USA) , and the vector and insert regions 
of the sequences were identified. Sequence similarity 

30 searches against public and commercial sequence 
databases were performed with the BLAST software 
package (Altschul et al . , 1990) version 1.4. Both the 
original nucleotide sequence and the six- frame 
conceptual translations of the insert region were used 

35 as query sequences. The used public databases were the 
EMBL nucleotide sequence database (Stoesser et al . , 
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1998) , the SWISS-PROT protein sequence database and 
its supplement TrEMBL (Bairoch and Apweiler, 1998) , 
and the ALCES Candida albicans sequence database 
(Stanford University, University of Minnesota) . The 
5 commercial sequence databases used were the LifeSeq® 
human and PathoSeq® microbial genomic databases 
(Incyte Pharmaceuticals Inc., Palo Alto, CA, USA), and 
the GENESEQ patent sequence database (Derwent, London, 
UK) . Three major results were obtained on the basis of 

10 the sequence similarity searches: function, novelty, 
and specificity. A putative function was deduced on 
the basis of the similarity with sequences with a 
known function, the novelty was based on the absence 
or presence of the sequences in public databases, and 

15 the specificity was based on the similarity with 
vertebrate homologues . 

Methods 

Blastx of the nucleic acid sequences against the 
-20 - appropriate protein databases:- Swi-ss-Prot for elones- 
of which the complete sequence is present in the 
public domain, and paorfp (PathoSeq™) for clones of 
which the complete sequences is not present in the 
public domain. 

25 The protein to which the translated nucleic acid 

sequence corresponds to is used as a starting point. 
The differences between this protein and our 
translated nucleic acid sequences are marked with a 
double line and annotated above the protein sequence. 

30 The following symbols are used: 

A one-letter amino acid code or the ambiguity 
code X is used if our translated nucleic acid sequence 
has another amino acid on a certain position. 

The stop codon sign * is used if our translated 

35 nucleic acid sequence has a stop codon on a certain 
position, 

The letters fs (frame shift) are used if a frame 
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shift occurs in our translated nucleic acid sequence, 
and another reading frame is used. 

The words ambiguity or ambiguities are used if a 
part of our translated nucleic acid sequence is 
5 present in the proteins, but not visible in the 
alignments of the blast results. 

The phrase ''missing sequence" is used if the 
translated nucleic acid sequence does not comprise 
that part of the protein. 
10 Blastx: compares the six-frame conceptual 

translation products of a nucleotide query sequence 
(both strands) against a protein sequence database. 
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Ant i sense Experiments 



Clones 383c_cp, 392c_cp, 417c_cpG2L, 325c_af, 
322c_cp, 26g3, and 409c_cp were transformed with 
plasmid pGALlPSiST-1 containing the galactose 
inducible promotor/expression cassette. The plasmid 

-20 - was -modif ied to include ant isense molecules capable of 
binding to RNA expressed from DNA corresponding to SEQ 
ID NOs: 1-2 and SEQ ID NOs 4-8 natively present the 
Candida, albicans cell. Following transformation the 
cells were grown in the presence of glucose or 

25 galactose to determine the effect of the induced 
antisense molecule on protein expression and on 
Candida albicans cell growth. An exemplary growth 
curve is provided in Figure 5. All cells transformed 
with the antisense molecules demonstrated significant 

30 growth reduction. 

Gene Knock-outs 

To verify that the growth effect was due to the 
interference with the identified gene and to support 
35 the specificity of the antisense effect, single and 
double allele knock-outs were made in the identified 
genes using the URA-blaster method (Fonzi and Irwin 
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1993) . Figure 4 provides an example of a growth curve 
for a double allele knock-out as compared to the wild- 
type strain CAI4 . Growth curves are shown as the 
measurement of optical density (Y-axis) over time in 
5 hours (X-axis) . The. optical density is a measure for 
the number of cells present and this a measure for 
growth. A reduction in growth was observed for this 
double knock-out as compared to ; the wild- type strain. 

10 Screening for compounds modulating expression of 

polypeptides critical for growth and survival of C. 
albicans 

The method proposed is based on observations 
(Sandbaken et al . , 1990; Hinnebusch and Liebman 1991; 

15 Ribogene PCT WO 95/11969, 1995) suggesting that 

underexpression or overexpression of any component of 
a process (e.g. translation) could lead to altered 
sensitivity to an inhibitor of a relevant step in that 
process. Such an inhibitor should be more potent 

20 against a cell limited by a deficiency in the 

macromolecule catalyzing that step and/or less potent 
against a cell containing an excess of that 
macromolecule, as compared to the wild type (WT) cell. 
Mutant yeast strains, for example, have shown 

25 that some steps of translation are sensitive to the 

stoichiometry of macromolecules involved. (Sandbaken 
et al. 1996) . Such strains are more sensitive to 
compounds, which specifically perturb translation (by 
acting on a component that participates in 

30 translation) but are equally sensitive to compounds 
with other mechanisms of action. 

This method thus not only provides a means to 
identify whether a test compound perturbs a certain 
process but also an indication of the site at which it 

35 exerts its effect. The component which is present in 
altered form or amount in a cell whose -growth is 
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affected by a test compound is potentially the site of 
action of the test compound. 

The assay involves measurement of growth of an 
isogenic strain which has been modified only in a 
5 certain specific allele, relative to a wild type (WT) 
C. albicans strain, in the presence of R-compounds. 
Strains can be ones in which the expression of a 
specific essential protein is impaired upon induction 
of anti -sense or strains which carry disruptions in an 

10 essential gene. An in silico approach to finding 
novel essential genes in C. albicans will be 
performed. A number of essential genes identified in 
this way will be disrupted (in one allele) and the 
resulting strains can be used for comparative growth 

15 screening. 

Assay for High Throughput screening for drugs 

35 |al minimal medium (S medium + 2% galactose + 
2% maltose) is transferred in a transparent flat- 
20 bottomed" 96 well plate using- an automated pipetting 

system (Multidrop, Labsystems) . A 96-channel pipettor 
(Hydra, Robbins Scientific) transfers 2 . 5 fil of R- 
compound at 10"^ M in DMSO from a stock plate into the 
assay plate. 

25 The selected C. albicans strains (mutant and 

parent (CAI-4) strain) are stored as glycerol stocks 
(15%) at -70°C. The strains are streaked out on 
selective plates (SD medium) and incubated for two 
days at 30°C. For the parent strain, CAI-4, the medium 

30 is always supplemented with 2 0 |j,g/ml uridine. A single 
colony is scooped up and resuspended in 1 ml minimal 
medium (S medium + 2% galactose + 2% maltose) . Cells 
are incubated at 30^C for 8 hours while shaking at 
250 rpm. A 10 ml culture is inoculated at 250,000 

35 cells/ml. Cultures are incubated at 30°C for 24 

hours while shaking at 250 rpm. Cells are counted in 
Coulter counter and the final culture (S medium +2% 
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galactose + 2% maltose) is inoculated at 20,000 to 
50,000 cells/ml. Cultures are grown at 30°C while 
shaking at 250 rpm until a final OD of 0.24 (+/- 0.04) 
600nM is reached. 
5 200 |i,l of this yeast suspension is added to all 

wells of MW96 plates containing R-compounds in a 450 
(or 250) |il total volume. MW96 plates are incubated 
(static) at 30°C for 48 hours. 

Optical densities are measured after 48 hours. 
10 Test growth is expressed as a percentage of 

positive control growth for both mutant (x) and wild 
type (y) strains. The ratio (x/y) of these derived 
variables is calculated. 
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Table 1 



Seq ID No. 


Clone 


Function 


1 


3 82c_cp 




2 


3 92c_cp 


TUFl 
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3 




RAD5 3 


4 


417c_cpG2L 




5 


3 2 5c_af 




6 


3 2 2c_cp^ 




7 


2 6g3 




8 


4 0 9c_cp 




9 


382c_cpGlL2 




10 


382c_cp (prt) 




11 


3 92c_cp (prt) 


TUFl 


12 




RAD53 


13 


325c_af (prt)^ 




14 


322c_cp (prt)^ 




15 


26g3 (prt) 




16 


417c_cp 92L (prt) 





1. 322c-cp is a member of the UPF0057 protein 
family. It contains potential transmembrane 
regions (6-23aa; 30-53aa) and could be low 

5 temperature or salt-stress inducible. 

2. 325c-af shows similarity to IMP4 yeast and 
related proteins and it might be involved in rRNA 
processing in Candida albicans in a similar way 

10 to IMP4. 
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What is claimed is: 

1. A nucleic acid molecule encoding a polypeptide 
which is critical for survival and growth of the yeast 

5 Candida albicans and which nucleic acid molecule is 
selected from the group consisting of SEQ ID NO: 1 to 
SEQ ID NO: 9. 

2. A nucleic acid molecule encoding a polypeptide 

10 which is critical for survival and growth of the yeast 
Candida albicans and which nucleic acid molecule is 
selected from the group consisting of SEQ ID NO: 1 to 
SEQ ID NO: 3 . 

15 3 . A nucleic acid molecule encoding a polypeptide 

which is critical for survival and growth of the yeast 
Candida albicans and which nucleic acid molecule is 
selected from SEQ. ID NO: 1 or SEQ. ID. NO: 2 and 
fragments or derivatives of said nucleic acid 

20 molecules . - — - — 

4. The nucleic acid molecule of Claim 1 which is mRNA. 

5. The nucleic acid molecule of Claim 1 which is DNA. 

25 

6. The nucleic acid molecule according to claim 5 
which is cDNA. 

7. A first nucleic acid molecule capable of 

30 hybridising to a second nucleic acid molecule encoding 
a polypeptide which is critical for survival and 
growth of the yeast Candida albicans and which second 
nucleic acid molecule is selected from the group 
consisting of SEQ ID NO: 1 to SEQ ID NO : 9 wherein 

35 hybridsation occurs under high stringency conditions. 



An antisense molecule comprising a nucleic acid 
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molecule capable of hybridising to a nucleic acid 
molecule encoding a polypeptide which is critical for 
survival and growth of the yeast Candida albicans 
wherein the nucleic acid molecule is selected from the 
5 group consisting of SEQ ID NO: 1 to SEQ ID NO: 9. 



9. Cells containing a nucleic acid molecule encoding 
a polypeptide which is critical for survival and 
growth of the yeast Candida albicans and which nucleic 

10 acid molecule is selected from the group consisting of 
SEQ ID NO: 1 to SEQ ID NO: 9, wherein said cells are 
bacterial or eukaryotic. 

10. A polypeptide encoded by a nucleic acid molecule 
15 which is critical for survival and growth of the yeast 

Candida albicans and which nucleic acid molecule is 
selected from the group consisting of SEQ ID NO: 1 to 
SEQ ID NO: 9. 

20 11-. — -A- polypet ide having an amino acid sequence 

selected from the group consisting of SEQ ID NO: 10 to 
SEQ ID NO: 16 . 



12. A recombinant DNA construct comprising a nucleic 
25 acid molecule encoding a polypeptide which is critical 

for survival and growth of the yeast Candida albicans 
and which nucleic acid molecule is selected from the 
group consisting of SEQ ID NO: 1 to SEQ ID NO: 9, 
wherein the nucleic acid molecule is DNA. 

30 

13 . A recombinant DNA construct comprising a first 
nucleic acid molecule wherein the first nucleic acid 
molecule or a molecule complementary to the first 
nucleic acid molecule is capable of hybridizing a 

35 second nucleic acid molecule encoding a polypeptide 

which is critical for survival and growth of the yeast 
Candida albicans and which second nucleic acid 
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molecule is selected from the group consisting of SEQ 
ID NO: 1 to SEQ ID NO : 9 wherein the first nucleic 
acid molecule is inserted in the antisense 
orientation . 

5 

14. A recombinant DNA construct according to claim 12 
wherein said recombinant DNA construct is an 
expression vector. 

10 15. The construct according to claim 14 which 
comprises an inducible promoter. 

16. The construct according to claim 14 which 
comprises a sequence encoding a reporter molecule. 

15 

17. Cells containing a recombinant DNA construct 
comprising a nucleic acid molecule encoding a 
polypeptide which is critical for survival and growth 
of the yeast Candida albicans and which nucleic acid 

20 molecule -is selected f rom -the - group -consisting of SEQ 
ID NO: 1 to SEQ ID NO : 9, wherein said cells are 
bacterial or eukaryotic. 

18. The nucleic acid molecule according to claim 1 in 
25 a pharmaceutically acceptable carrier, diluent, 

excipient or buffer. 

19. A method for treating a Candida albicans- 
associated disease comprising the step of: 

30 administering a composition of matter comprising an 

antisense nucleic acid molecule capable of binding to 
a Candida alJbi cans-originating nucleic acid sequence 
selected from the group consisting of SEQ ID NO : 1 to 
SEQ ID NO: 9 . 

35 

20. A pharmaceutical composition comprising a 
polypeptide according to claim 10 together with a 
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pharmaceut ically acceptable carrier diluent or 
excipient therefor . 



21. A Ca.ndida. albicans cell comprising an induced 

5 mutation in the DNA sequence encoding the polypeptide 
according to claim 10. 

22. A method of identifying compounds which 
selectively modulate expression or functionality of 

10 polypeptides or metabolic pathways in which these 

polypeptides are involved and which are crucial for 
growth and survival of Candida albicans ^ which method 
comprises: 

(a) contacting a compound to be tested with one 
15 or more Candida albicans cells having a 

mutation in a nucleic acid molecule 
according to claim 1 which mutation results 
in overexpression or underexpression of said 
polypeptides in addition to contacting one 
20 - - _ _ or more wild type Candida albicans cells 

with said compound; and 

(b) monitoring the growth and/or activity of 
said mutated cell compared to said wild 
type; wherein differential growth or 

25 activity of said one or more mutated Candida 

cells is indicative of selective action of 
said compound on a polypeptide or another 
polypeptide in the same or a parallel 
pathway . 



30 



23. A compound identifiable according to the method 
of claim 22 . 



24. A pharmaceutical composition comprising a 
35 compound according to claim 23 together with a 
pharmaceutically acceptable carrier, diluent or 
excipient therefor. 
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25. A method- of identifying DNA sequences from a cell 
or organism which DNA encodes polypeptides which are 
critical for growth or survival of said cell or 
5 organism, which method comprises: 

(a) preparing a cDNA or genomic library from 
said cell or organism in a suitable 
expression vector which vector is such that 
it can either integrate into the genome in 

10 said cell or that it permits transcription 

of antisense RNA from the nucleotide 
sequences in said cDNA or genomic library; 
and 

(b) selecting transf ormants exhibiting impaired 
15 growth and determining the nucleotide 

sequence of the cDNA or genomic sequence 
from the library included in the vector from 
said transf ormant . 



- 20 2 6 .- -The method aeeording- to claim 2 5 wherein said 

cell or organism is a yeast or filamentous fungus. 

27. The method according to claim 25 wherein said 
cell or organism is selected from the group consisting 
25 of SaccharomycGs cerevisisie , Schizosaccharomyces pombe 
or Candida albicans , 



28. An antibody capable of binding to a polypeptide 
encoded by a nucleic acid molecule which is critical 

30 for survival and growth of the yeast Candida albicans 
and which nucleic acid molecule is selected from the 
group consisting of SEQ ID NO: 1 to SEQ ID NO : 9. 

29. An oligonucleotide comprising a fragment of from 
35 10 to 120 contiguous nucleotides of a nucleic acid 

molecule which is critical for survival and growth of 
the yeast Candida albicans and which nucleic acid 



molecule is selected from the group consisting of 
ID NO: 1 to SEQ ID NO : 9. 

30. The oligonucleotide according to claim 29 
comprising a fragment of from 10 to 50 contiguous 
nucleotides . 
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ABSTRACT 

DRUG TARGETS IN CANDIDA ALBICANS 

The present invention is concerned with the 
identification of genes or functional fragments 
thereof from Candida albicans which are critical for 
growth and cell division and which genes may be used 
as selective drug targets to treat Candida, albicans 
associated infections. Novel nucleic acid sequences 
from Candida albicans are also provided and which 
encode the polypeptides which are critical for growth 
of Candida albicans. Methods for the ' identification of 
anti- fungal compounds which inhibit fungal or yeast 
growth are also contemplated. 
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I hereby claim foreign priority benefits under 35 U.S.C. 119{a)-{d) or 365(b) of any foreign appl)cation{s) for patent or inventor's 
certificate, or 365(a) of any PCT international application which designated at least one country other than the United States of 
America, listed below and have also identified below, by checking the box, any foreign application for patent or inventor's certificate, 
or of any PCT international application having a filing date before that of the application on which priority Is claimed. 



Prior Foreign Application 
Number(s) 



Country 



Foreign Filing Date 
(MM/DD/YYYY) 



Priority 
Not Claimed 



Certified Copy Attached? 
YES Ei£2 , 



98204122.0 



EP 



12/04/1998 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



D Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/Q2B attached hereto: 
I hereby claim the benefit under 35 U.S.C. 1 19fe) of any United States provisional application(s) listed below. 



Application Number(s) 



Filing Date (MM/DD/YYYY) 



I j Additional provisional application 
numbers are listed on a 
supplemental priority data sheet 
PTO/SB/02B attached hereto. 
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Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the 
individual case. Any comments on the amount of time you are required to complete this form should be sent to the Chief Information 
Officer, Patent and Trademark Office, Washington. DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS 
ADDRESS. SEND TO: Assistant Commissioner for Patents, Washington, DC 20231. 



Please type a plus sign {+) inside this box 



PTO/SB/01 (12-97) _ 
Approved for use through 9/30/00. OMB 0651-0032 ' 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains 
a valid OMB control number. 



DECLARATION — Utility or Design Patent Application 



I hereby claim the benefit under 35 U.S.C. 120 of any United States applicatton(s). or 365(c) of any PCT international application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 U.S.C. 112, I acknowledge the duty to disclose 
information which is material to patentability as defined in 37 CFR 1 .56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. 



U.S. Parent Application or PCT Parent 
Number 


Parent Filing Date 
(MM/DD/YYYY) 


Parent Patent Number 
(If applicable) 








1 1 Additional U.S. or PCT international application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto. 



As a named inventor, I hereby appoint the following registered practi tioner(s) to prosecute this app lication and to tra nsact all business in the Paten t 
and Trademark Office connected therewith: □ Customer Number [ 

OR 



D Registered practitiQner(s) name/registration number listed below 



Ptaca Customer 
Number Bar Code 
i ahRi hftrei 



Name 


Registration 
Number 


Name 


Registration 
Number 


Steven P. Berman 
Andrea L. Colby 


r 

.24,IZ2_ 
30J94_ 


/lichaet Stark \ 

Ellen C. Coletti 
Mary A. Appollina 


34.140 
34.08Z. 



L!l Additional registered practitionerfs) named on supplemental Registered Practitioner Information sheet PTO/SB/02C attached hereto 



Direct all correspondence to: □ Customer Number 

or Bar Code Label 



OR [3 Correspondence address below 



Name 



Phili2.S,jDbD§on 



Address 



J qhn5Dn-&-Jetwsen 



Address 



OneJiDbF^n-SriJohnsefv-Raza" 



City 



Country 



'CJSA 



Telephone 



State 



NJ 



(732) 524-2359 



ZIP 



Fax 



08933-7003 



(732) 524-2808 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Name of SolejpKFirst Inventor 




□ A petition has been filed for this unsigned inventor 



Giyei 



first and middle Tif any]) 



Family NanriP fir Surname 



^"^oland Henri 



Contreras 



Inventor's 
Signature 






Date 




Residence: City 


9000 Gent 3^,, 


coumrv Belgium 


Citizenship 


BE 


Post Office Address 


c/o University of Gent, K.L. Ledeganckstraat 35, B-9000 Gent, Belgium 


Post Office Address 




City Schelderoc 


e/Merelbek^tate zip 


9820 1 Country 1 "^Belgium 



D Additional inventors are being named on the supplemental Additional Inventor(s) sheet(s) PTO/SB/02A attached hereto 
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o 'Qi-e 7 '7' :E:: 



111 O H i|j J 



Please type a plus sign (+) inside this box 



PTO/SB/02A (3-97) 
Approved for use through 9/30/98. OMB 0651-0032 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains a 
valid OMB control number. 
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DECLARATION 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 

Page _L of 2 




Name QfAdditional Jointinventor, If any 



I I A petition has been filed for this unsigned inventor 



Post Office Address 



do Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Belgiurr 



Post Office Address 



City 



Meerhout 



2450 



Country 



Belgium 



Name of Additional Joint Inventor, if any: 



n A petition has been filed for this unsigned inventor 



Given Nanr>e (first and middle [if any]) 



Fannily Uame or Surname 



Marianne Denise 



De Backer 



Inventor's 
Signature 



Date 



Residence: City 



Beerse 



state 



Country 



Belgium 



Citizenship 



BE 



Post Office Address 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Belgiun 



Post Office Address 



City 



San Diego 



Name of Ac^dlH^nal Joint Inventor, if any: 




CA 



ZIP 



92122 



Country 



USA 



r~l A petition has been filed for this unsigned inventor 



jyenlName (first and middle [if any]) 



Family Nanrve or Surname 



Walter Herman Maria Lquls 



Luyten^ 




Post Office Address 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Belgiun 




Post Office Address 



City 



Turnhout 



state 



2300 



Country 



Belgium 



Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the individual case. Any 
comments on the amount of time you are required to complete this form should be sent to the Chief Information Officer. Patent and Trademark 
Office. Washington. DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for 
Patents. Washington, DC 20231. 



Please type a plus sign (+) inside this box 
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DECLARATION 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 
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Name of Additional Joint Inventor, if any: 



I I A petition has been filed for this unsigned inventor 



Given Name (first and nriiddle [if any]) 



Family Nanrie or Surname 



Jasmine Elza 



Viaene 



Inventor's 
Signature 



Date 



Residence: City 



Gent 



State 



Country 



Belgium 



citizenship 



BE 



Post Office Address 



c/o University of Gent, K.L. Ledeganckstraat 35. B-9000 Gent. Belgium 



Post Office Address 



City 



Varsenare 



state 



8490 



Country 



Belgium 



Name of Additional Joint Inventor, if any: 



[~| A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Marc George 



Logghe 



inventor's 
Signature 



Date 



Residence: City 



Gent 



Country Belgium citizenship | BE 



Post Office Address 



c/o University of Gent, K.L. Ledeganckstraat 35, B-9000 Gent, Belgium 



Post Office Address 



City 



St.-Denijs-Westrem 



state 



Name of Additional Joint Inventor, if any: 



ZIP 



9051 



Country 



Belgium 



I 1 A petition has been filed for this unsigned inventor 



Given Name (first and middle JLapy]) 



Family Name or Sumame 



JorgjS Eduardo 



Vialard 



inventor's 
Signature 




Date 



Residence: City 



Beerse 



Country Belgium 



Citizenship 



CA 



Post Office Address 



c/o Janssen Pharmaceutica N.V.. Turnhoutsevyeg 30. B-234Q Beerse, Belgiurr 



Post Office Address 



City 



Brussel 



state 



ZIP 



1200 



Country 



Belgium 



I- 



Burden Hour Statement; This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the individual case. Any 
comments on the amount of time you are required to complete this form stiould be sent to the Chief Information Officer. Patent and Trademark 
Office. Washington. DC 20231. DO NOT SEND FEES OR COIVIPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for 
Patents, Washington, DC 20231 . 



Please type a plus sign (+) inside this box - 
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DECLARATION 



ADDITIONAL fNVENTOR(S) 
Supplemental Sheet 
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Name of Additional Joint Inventor, if any: 



I I A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Bart Jozef Maria 



Nelissen 



Inventor's 
Signature 



Date 



Residence: City 



Beerse 



state 



Country 



Belgium 



citizenship 



BE 



Post Office Address 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Beigiunr 



Post Office Address 




Post Office Address 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Belgiun 



Post Office Address 



City 



San Diego 



state 



Name of Additional Joint Inventor, If any: 



CA 



92122 



Country 



USA 



I I A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Walter Herman Maria Louis 



Luyten 



Inventor's 
Signature 



Date 



Residence: City 



Beerse 



state 



Country 



Belgium 



Citizenship 



BE 



Post Office Address 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30, B-2340 Beerse, Belgiun 



Post Office Address 



City 



Turnhout 



state 



ZIP 



2300 



Country 



Belgium 



+ 



Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the individual case. Any 
comments on the amount of time you are required to complete this form should be sent to the Chief Information Officer, Patent and Trademark 
Office. Washington. DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for 
Patents. Washington. DC 20231 . 
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DECLARATION 



ADDITIONAL iNVENTOR(S) 
Supplemental Sheet 

Page _2_ of 2_ 



Family Name or Surname 




Name of Additional Joint Inventor, If any: 



^6lgerrNag]&^(fji[st middle [if any]) 



I I A petition has been filed for this unsigned inventor 



Post Office Address 



Post Office Address 



City 



c/o University of Gent, K.L. Ledeganckstraat 35, B-9000 Gent, Belgium 



Varsenare 



8490 



Country 



Belgium 



Name of Additional Joint Inventor, if any: 



I I A petition has been filed for this unsigned Inventor 




Post Office Address 



Post Office Address 



City 



c/o University of Gent, K.L. Ledeganckstraat 35, B-9000 Gent, Belgium 



St.-Denijs-Westrem 



state 



Name of Additional Joint Inventor, if any: 



ZIP 



9051 



Country 



Belgium 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [If any]) 



Jorge Eduardo 



Inventor's 
Signature 



Residence: City 



Post Office Address 



Post Office Address 



City 



Family Nanr^ or Surname 



Vialard 



Beerse 



state 



Country I Belgium 



Date 



Citizenship 



CA 



c/o Janssen Pharmaceutica N.V., Turnhoutseweg 30. B-2340 Beerse, Belgiunr 



Brussel 



state 



1200 



Country 



Belgium 



+ 



Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the individual case. Any 
comnnents on the amount of time you are required to complete this form should be sent to the Chief Information Officer, Patent and Trademark 
Office. Washington. DC 20231, DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for 
Patents. Washington, DC 20231. 



DOCKET NO. JAB- 143 0 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicants 



CONTRERAS et al . 



Serial No. 



Art Unit : 



Filed 



June 4, 2 0 01 



Examiner : 



For 



DRUG TARGETS IN CANDIDA ALBICANS 



Commissioner for Patents 
Washington, D.C. 20231 



ASSOCIATE POWER OF ATTORNEY 



Sir : 



In the matter of the above -identified application, I hereby 
appoint Myra H. McCormack (Reg. No. 36,602), whose postal address 
is One Johnson & Johnson Plaza, New Brunswick, New Jersey 
08933-7003, my associate attorney to prosecute said application, 
to make alterations and amendments therein, to file continuing 
applications ^craimihg the" benefit " of said applicationT "to receive 
the patent and to transact all business in the Patent Office 
connected with said application. 

I request all communications with respect to said application 
be addressed to Philip S. Johnson, One Johnson & Johnson Plaza, 
New Brunswick, New Jersey 08933-7003. All telephone calls should 
be directed to Myra H. McCormack (732) 524-6932. 

Signed at New Brunswick, in the County of Middlesex and State 
of New Jersey, this 4th day of June, 2001. 




Reg. No. 34,087 

Attorney for Applicant (s) 



One Johnson & Johnson Plaza 
New Brunswick, NJ 08933-7003 
(732) 524-3742 
DATED: June 4, 2 0 01 
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SEQUENCE LISTING 

<110> Contreras, Roland 
Nelissen, Bart 
5 DeBacker, Marianne 

Luyten, Walter 
Viaene, Jasmine 
Logghe , Marc 

10 <120> Drug Targets in Candida Albicans 

<130> 53731/000 

<140> JAB 1430-US 
15 <141> 

<150> 982204122.0 
<151> 1998-12-04 

20 <160> 20 

<170> Patentin Ver. 2.0 

<210> 1 
25 <211> 438 
<212> DNA 

<213> Candida albicans 
<400> 1 

30 aacagctggt cttctgctaa tacattcaac cctttccata tctatactcc aacaatatga 
60 

taactgatga acaattgaat accattgcat tgacatttgg ttttgcttca ataatattaa 

. . 120__ . . _ _ . _ . . . . . ... 

tcataatata tcatgccata tctactaatg tacataaatt agaagatgaa accccatcat 
35 180 

cttcatttac cagaacaaat actactgaaa ctactgttgc aagtaagaaa aagaagtaat 
240 

aactgatgga tttttcttcc taccaccaat tgaataatgc tagacttgtt ggtgtgctac 
300 

40 aaatatttca aaagaaaata cgaatacttt ataaaatggt aagaacggaa gatggtttct 
360 

catttataca ctaaatacaa atcacataca catacacaaa cacaaataca tacatacacc 
420 

tatatccctt tatttgat 
45 438 

<210> 2 

<211> 1380 , 
<212> DNA 
50 <213> Candida albicans 

<400> 2 

atgttaaaaa cactaactca aactttacgc ttaactggga aagctttccc aaaggtccgt 
60 

55 ccggccttga tcagaaccta cgctgccttc gaccgttcta aacctcatgt caacattggt 
120 

actattggtc atgttgatca tggtaaaact acattgactg ctgctatcac caaagtttta 
180 

gccgaacaag gtggtgccaa cttcttggat tatggttcta ttgatagagc tccagaagaa 
60 240 

agagcfcagag gtatcactat ttccactgcc cacgttgaat acgaaaccaa gaacagacac 
300 
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tatgcccacg ttgattgtcc aggacacgct gattatatca aaaatatgat tactggtgcc 
360 

gctcaaatgg atggtgctat cattgttgtt gctgccactg atggtcaaat gcctcaaacc 
420 

5 agagaacatt tgttattggc cagacaagtt ggtgttcaag acttggttgt gtttgtcaac 
480 

aaagtcgata ctattgatga ccctgaaatg ttggaattag tcgaaatgga aatgagagaa 
540 

ttgttatcca cctacggttt tgatggtgac aacactccag ttattatggg atctgcttta 
10 600 

atggctttgg aagacaagaa accagaaatt ggtaaggaag ctatcttgaa attgttagat 
660 

gctgtcgatg aacacattcc aactccatca agagacttgg aacaaccatt tttgttacca 
720 

15 gttgaagacg tgttctccat ctccggtaga ggaactgttg tcactggtag agttgaaaga 
780 

ggtgttttga agaagggtga agaaatcgaa attgttggtg gttttgacaa accttacaag 
840 

actactgtta ccggtattga aatgttcaaa aaagaattag actctgctat ggctggtgac 
20 900 

aactgtggtg ttttgttaag aggtgttaaa agagatgaaa tcaagagagg tatggttttg 
960 

gccaaaccag gtactgctac ttctcacaag aagttcttgg cttccttgta tattttgact 
1020 

25 tccgaagaag gtggtcgttc cactccattt ggtgaaggtt acaagcctca atgcttcttc 
1080 

agaactaacg atgtcactac cacattttca ttcccagaag gagaaggtgt tgatcattct 
1140 

caaatgatca tgccaggtga caacattgaa atggttggtg aattgatcaa atcttgtcca 
30 1200 

ttagaagtca accaacgttt caacttgaga gaaggtggta aaactgttgg tactggtttg 
1260 

attaccagaa itcatcgaata-aacagaatgt. gcactgtgaa - taataaaaag aaaagaggta - . . 

1320 

35 tatataggtg actttgtatt ttgtattgaa caataaaatt ctgtaaatag taagggcctc 
1380 

<210> 3 
<211> 2283 
40 <212> DNA 

<213> Candida albicans 

<400> 3 

gaattcgccc ttaagcactc gtttcaacta tacattcagt aacaacaccc ttaatttacc 
45 60 

aaactacatt aatggaagta acacaacgga cgcagagtca gacacaacca acacaacagt 
120 

caccgacaac tcagacgcaa acccaaagca aagaggacca gaataggatt tgtcaattga 
180 

50 tttgctccac gggtcagttt ggcaattatg atttgaatat caacgataaa actatcgtac 
240 

aaggtaaaat gacgtggtat tttggaagag accccaactc agatttgcaa gtggcgtcgt 
3 00 

cgtcgagaat ttcaaacaag cattttcaaa tctggctcaa cttcaatgat aaatcactat 
55 360 

ggataaagga cacttcaact aacgggacac accttaacaa cagtcgattg gtgaaaggat 
420 

caaactacct tcttaatcag ggtgatgaaa tagcagtagg ggttggtaga gacgaggacg 
480 

60 ttgtgaggtt tgtcgttgtc tttggtgaca aatacaaccc ggcaaagcta cctgattcga 
540 
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ccaacacaat taaagatgaa ggaatataca aagactttat tgtgaaaaat gaaacgatag 
600 

gccaaggagc atttgccact gtgaaaaagg cgattgaacg atctacgggc gagtcgtacg 
660 

5 cggtgaagat tataaatcga agaaaagcat taaataccgg tggtggaagt gccatggcag 
720 

g^gtggaccg tgaattgtcc atattagagc ggctcaacca cccaaatata gttgctctaa 
780 

aagcttttta tgaagatatg gacaattact atattgtgat ggaattggtg ccgggcggtg 
10 840 

atttgatgga ctttgtggct gcaaacggtg caataggaga agacgcaaca caagtgatca 
900 

cgaaacagat tctagaagga attgcctatg ttcataattt aggaatctcc catcgtgatt 
960 

15 tgaagccaga taatattttg attatgcaag atgacccaat acttgttaaa atcaccgact 
1020 

ttggattggc aaaattcagt gacaatctga cgtttatgaa aactttttgt ggtacattgg 
1080 

. cgtatgttgc tcccgaagtt atcaccggta agtatggatc atcgcagatg gaactgcaac 

20 1140 

aaaaggacaa ctactcttcc ttggttgaca tttggtcttt gggatgtttg gtttatgtac 
1200 

ttttaacttc tcatttacca ttcaacggga aaaaccagca acaaatgttt gccaagatca 
1260 

25 aaaggggcga atttcatgag gctccattaa attcatacga catttctgaa gacggaagag 
1320 

atttcttgca gtgctgccta caggttaatc ctaaactaag gatgacggct gctgaagctt 
1380 

tgaaacataa atggttgcaa gacttgtatg aagaggattc tgtcaaatca ttgagtttat 
30 1440 

cgcaatcaca gtcgcaacaa tctcgaaaga tagataatgg tatccatatc gaatcattga 
1500 

. . . gcaaaattga. tgaagacgtt- atgcttcgtc cattggatag cgaaagaaat aggaaatcaa 
1560 

35 gtaaacagca agatttcaag gtacccaagc gtgtga.ttcc gttatctcaa catcctgcaa 
1620 

caccgttacc aatgtcacaa ccgaaaaaga ggccgtatca aatagaccct agaacaaaca 
1680 

aaaaagtcga tttggaagaa cctctgacaa gcaagaaagt caagctaagt gattccgttg 
40 1740 

ttgcggaaga ctacttgaag ttggggccac ttgcaaattc gttattccaa gaaacaataa 
1800 

atatttcaaa gtccccgttt tctttcggaa gaaatgacac ttgtgattgc gagatagacg 
1860 

45 acgacagact atccaaactt cattgtgtca ttaccaaaga aaacgactct atatggttat 
1920 

tggataagag tactaactcg tgcttggtca acaatactag tgttggaaaa ggcaacaaag 
1980 

ttttgcttag aggaggggag atattacatc tcttctttga cccattgtca ctgcaacata 
50 2040 

taggtttcaa agtagtcctt gttgatcaac tgtctggtga acataagagt caagtggagg 
2100 

ttttgaaaca aacctcagaa gaaatgaata ttattccact tatttctggt ttaagtagta 
2160 

55 taagttcata gatttagcat atatacaagc atttcctata gaaacaaagg ttcattaatt 
2220 

tagttattta cctccatgca attacattta cttcttcttc caagggcgaa ttctgcagat 
2280 
ate 
60 2283 

<210> 4 
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<211> 826 
<212> DNA 

<213> Candida albicans 
<400> 4 

atgggtagta tgtgaagata caatattgaa agtgtttact agaatatcta agatgtttga 
60 

gcccatggac atttttggat ttgataatta aaaaaagtag caatagatta ttgcgttgga 
120 

gaaagaatca ccatagttgc aagatttgat agatgttaaa atgttcacgc aggcgaaaga 
180 

tgtaacatct cttaaagtaa gaagaatatg gacatgaata aaaatagata gcactatttt 
240 

ggaacttgtt gaagatatta aaatagaatg ggatttcaac atagatattc aaagtaacga 
300 

aacctcacaa tcaaataaaa acaacagtaa tactaacaat tcaattttta tttttataga 
360 

gggtactcca tctttaggta aacgtcacaa caaatctcac accttatgta acagatgtgg 
420 

ccgtcgttca ttccacgtcc aaaagaagac ctgttcttct tgtggttacc cagctgctaa 
480 

aatgagatct cacaactggg ccttaaaagc caaaagaaga agaactactg gtaccggtag 
,540 

aatggcttac ttgaaacacg ttaccagaag attcaagaac ggtttccaaa ctggtgttgc 
600 

taaagctcaa accccttccg cttaaactaa ttactgaagt tattggtcat gcattagtca 
660 

ttattcatta aagtcatgtt aagcatagca aaggaagaat tggttagatt cttgtttaaa 
720 

atgtaatgac tatttaatat ctgtttaaat aagaggttta gtctttattt ttttacgtat 
780 

acaccaaaaa aaaaagaaac aaataaaatc tgtatattaa tgttgg 

_ 826 _ . ... . _. 

<210> 5 
<211> 978 
<212> DNA 

<213> Candida albicans 
<400> 5 

atgggtacta gtacaagtga agcattgaag aacatcaaaa acaaacagcg aagacagaaa 
60 

gtttttgcag aaataaaaca tgaaaagaat aaacaacgtc ataagcaaag agccgaaaga 
120 

gctaaggaag aaagagaaaa ccccgaatta agagaggaaa gaatagcagc taatatccca 
180 

gatactatag atagcaaacg tatttatgat gagactatag ctgctgaagt tgaaggagat 
240 

gacgagtttc agtcatattt cactaacttg ttggaagaac caaagatttt gttgacaaca 
300 

agtgccaatg ctaaaaaacc ggcctatgaa tttgcagaca tgatcatgga ctttttaccg 
360 

aatgtgacat ttatcaaaag gaagaaggaa tatacaatgc aagatatggc caaatattgc 
420 

tcgaatagag acttcactgc attgcttgtc atcaacgaag acaagaagaa ggtcaatggt 
480 

ataacgctca tcaatttacc tgaagggcca acattttatt tttcgattac atcaatagtt 
540 

gatgggaaaa gaattaaggg acacgggaaa gctggtgatt atttacctga gattgtattg 
600 

aataatttca attcaagatt gggtaaaact gtgggaagac tatttcaaag tattttccct 
660 
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cataaacctg aacttcaagg aagacaagtg attactttgc acaatcaacg tgattatatt 
720 

tttttcagaa gacatagata tattttcaga aatgaggaaa aggttggatt gcaggaattg 
780 

ggtccgcagt ttacattaaa gctaagaaga atgcaaaagg gagtacgtgg tgatgttgtt 
840 

tgggaacaca gaccagatat ggaaagagat aagaagaagt tttatttata agcgggtgta 
900 

taaaggtagt agtagtgcgt ttataagtat gtgtgtgtgt ttatgcatag atgtgtaaag 
960 

agtaatacag ctaattcg 
978 

<210> 6 
<211> 619 
<212> DNA 

<213> Candida albicans 
<400> 6 

aactaatttg tttaaacatc aataccaaga agtttttaca attcaatccc acatacacca 
60 

ttaattatga attctgaaaa gattattgaa gttatcattg ctattttctt accaccagta 
120 

gctgtgttta tgaaatgtgg tgccactacc ccattatgga ttaacttggt attatgtatc 
180 

tttatttggt tccctgctat cttacatgcc ttatacgttg tgttgaaaga ttaaacaaac 
240 

accagagatt tactgcttga tgaattgatt actccaaaga gttgtgacta gttcccagtg 
300 

tgtttttttt gccttccaac tttcttttac atttttccat tactaccact gtcttccccc 
360 

ctattttgca gagttttcaa aatttatcca aaacatgtta gtcattaaac catattatta 

_420- - -- - - -- -- -- - - . - - - - 

taattattct tttttgtatt tttttccctt aaaacacgtt aatttattaa tcgtttcgtt 
480 

gtttggtatt ttattttttt gtatttatca attggaatat atatctatac atgaatttat 
540 

tatccattgt accaattgtt aaaacatttt gttagttttt tgttactagt ataaaannat 
600 

aataaaagtt tanttcaac 
619 

<210> 7 
<211> 2319 
<212> DNA 

<213> Candida albicans 
<400> 7 

atgacattag ggttcgataa attcataagc aaggtcagca ctcatagacg tcaatctgaa 
60 

ccatcaatct tggaaatcgc agccaccaat tctcaaaata aatcgagaag gctaagtatg 
120 

gataatggtc attgttatgt tcgtgaatca actaataatc atcatcattt aaataccgtc 
180 

gttgataatt tacgacagcg tgcgggatcg ttttcattta tttcacatca ccataatcac 
240 

catcagaata gtcacgataa ttatactgtc gatcccctta catcaaacgg agcacgaatt 
300 

tcccgatcac gttcacgttc caaatcagtt gggcacggag aagcaatatc accagcgtat 
360 

ttttccaaga ataaaaccaa agatttagtg aaacaggaaa cagcacatat cattctgaag 
420 



aaattactca 
480 

tcacaaggtt 
540 

tacttaccag 
600 

ataattgctg 
660 

atggatagta 
720 

ttttctaaaa 
780 

tgtccaattc 
840 

aaagatcttc 
900 

tataatcgga 
960 

ttaagcgacg 
1020 

ttgggtcctg 
1080 

ccaaacaatt 
1140 

ccattatcaa 
1200 

ggttcagatt 
1260 

catattcctg 
1320 

tgcaataaat 
1380 

cctatggtcc 
1440 

aataggattt 
1500 

ttgggttgtg 
1560 

aagcgtatta 
1620 

gaatatgatt 
1680 

gtacgtgaac 
1740 

ggacatcttg 
1800 

tcttgttatg 
1860 

aacaagaacg 
1920 

aatgtttctt 
1980 

gaagaagaag 
2040 

gataataata 
2100 

actaatggtg 
2160 

cgacatattg 
2220 

gataataaaa 
2280 



acatgttaca 
cagaatcaaa 
cagcaagttc 
aagatagaaa 
taaatcattc 
tgaaatcatt 
cacatacatt 
atttccaatt 
ctcatgccaa 
ccgactatta 
aagatcttat 
catcaaacag 
cgtcaccaac 
catataaagc 
cttccattgt 
atactgataa 
gtactccacc 
ggaatgatgc 
aacacatgat 
aatttaatgt 
atgatagtga 
gtgttgtgtc 
aagcttataa 
aggttgaaaa 
ttaaacaaaa 
taccattttt 
gttcagatct 
ctaccccaag 
ctagtataaa 
aaattaaaca 
tgcatcatta 
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agatttggat 
gttttgtaaa 
aacaagtttc 
tgatgaaatg 
agagactgat 
ccattcacca 
tgctgtgatt 
tcagtcatta 
ggagaaattt 
tatcaatagt 
taatcgaact 
tttactggat 
atcaagttca 
tggtctttat 
ttctattaat 
gttgaatcgg 
aaacattggt 
cgtacattat 
aaatgtgaaa 
attggagaga 
agacccctat 
gttatatgaa 
acaagaagtt 
tgataataat 
gaataaagat 
aactactatg 
gcctcataca 
taacaataac 
tgaaattggt 
tcgattacaa 
tgaagtggtt 



ttacaaaacc 
atctacgtgt 
acttatgaag 
ccaacagcag 
ttcctggatt 
aattacttga 
gttgaattaa 
actaccattt 
accattggga 
tctaattcca 
agagaataca 
gatgacatgg 
acttcaacaa 
gtatttttat 
ggttcattgg 
aaatcaaaag 
aattccattg 
attataactt 
ttactgccca 
ataacttatg 
tgtattcatc 
ttgaaaacga 
atgaaatgtc 
aacggcaacg 
caaccaatga 
tctgatagtt 
tcaagaagag 
cctttatctc 
gatcatacat 
gttacattta 
attgataccc 



ctattgcatt 
ctaacactaa 
atgatgaaaa 
ttaataacaa 
ctccaccacc 
cttcaaaaat 
ccaaggactc 
tatggccaac 
atatggaatg 
acgatgttaa 
aattaatcga 
atattaataa 
attcaacctc 
taccaatctt 
ctcatacatt 
tatcagcatc 
ctgataagcc- 
tcccccgcaa 
tggtgaaaga 
tttccaaaaa 
cagtttctaa 
aggcaaaaca 
cggaaaataa 
gcaacggcaa 
ttgctacacc 
taattatgac 
ggtcggcagt 
catttttggg 
tattccctga 
ggatttctaa 
ccatcgtttt 
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gaaaacaata 

taattgtatt 

tggcggcgtt 

tactttgtca 

tccagattta 

cgattctgaa 

tttgattatt 

tggggatgca 

gcgtacatct 

gctgaaaaac 

tattgaagaa 

tattacgtcg 

caactcattg 

attgccagaa 

actggttgaa 

gtacaattta 

aatttatgtt . . 

atatgttact 

tgtggttatc 

tttatcacga 

agaaaataaa 

atcttctggt 

ccttttattt 

cggcaacgga 

tttagatatc 

atcagccata 

gagtatgact 

agcagtggaa 

ttctaatttt 

accggatctg 

acttagttca 
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aaatgtcaag aagattctcc tcctccttat agttctgta 
2319 

<210> 8 
<211> 255 
<212> DNA 

<213> Candida albicans 
<400> 8 

aacgttcgtg caaaaggcta tactggtgat atccacgcag atgaagagca agtttaatca 
60 

actctttgtc aattaatgct gtacttgttt tcattttatt tgctggcatt taaagaatac 
120 

ccatagttca gaaaataaaa ttgaaaaatt taaaaaaaaa cgcaatatca ttcatttttt 
180 

ttgttttttt gacaataata ttaatatgta gttaccaatg tttttagatt ttatatgttt 
240 

tgaaaaaata gtttg 
255 

<210> 9 
<211> 119 
<212> DNA 

<213> Candida albicans 
<400> 9 

aaccttacaa tcattatacc aactatcaaa atcataagac tcttnaactt ctgtttttga 
60 

tagttggtat aatgatttat gtattatctt aattcattat tattagtttc ggtcacaaa 
119 

<210> 10 

-<211> -6 0-- - - - - . - . -- - . 

<212> PRT 

<213> Candida albicans 
<400> 10 

Met lie Thr Asp Glu Gin Leu Asn Thr lie Ala Leu Thr Phe Gly Phe 
1 5 10 15 

Ala Ser lie lie Leu lie lie lie Tyr His Ala lie Ser Thr Asn Val 
20 25 30 

His Lys Leu Glu Asp Glu Thr Pro Ser Ser Ser Phe Thr Arg Thr Asn 
35 40 45 

Thr Thr Glu Thr Thr Val Ala Ser Lys Lys Lys Lys 
50 55 60 



<210> 11 
<211> 426 
<212> PRT , 

<213> Candida albicans 
<400> 11 

Met Leu Lys Thr Leu Thr Gin Thr Leu Arg Leu Thr Gly Lys Ala Phe 
15 10 15 



Pro Lys Val Arg Pro Ala Leu lie Arg Thr Tyr Ala Ala Phe Asp Arg 
20 25 30 
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Ser Lys Pro His Val Asn lie Gly Thr lie Gly His Val Asp His Gly 
35 40 45 

Lys Thr Thr Leu Thr Ala Ala lie Thr Lys Val Leu Ala Glu Gin Gly 
5 50 55 60 

Gly Ala Asn Phe Leu Asp Tyr Gly Ser lie Asp Arg Ala Pro Glu Glu 
65 70 75 80 

10 Arg Ala Arg Gly lie Thr lie Ser Thr Ala His Val Glu Tyr Glu Thr 

85 90 95 

Lys Asn Arg His Tyr Ala His Val Asp Cys Pro Gly His Ala Asp Tyr 
100 105 110 



15 



30 



45 



60 



He Lys Asn Met He Thr Gly Ala Ala Gin Met Asp Gly Ala He He 
115 120 125 



Val Val Ala Ala Thr Asp Gly Gin Met Pro Gin Thr Arg Glu His Leu 
20 130 135 140 

Leu Leu Ala Arg Gin Val Gly Val Gin Asp Leu Val Val Phe Val Asn 

145 150 155 160 

25 Lys Val Asp Thr He Asp Asp Pro Glu Met Leu Glu Leu Val Glu Met 

165 170 175 



Glu Met Arg Glu Leu Leu Ser Thr Tyr Gly Phe Asp Gly Asp Asn Thr 
180 185 190 

Pro Val He Met Gly Ser Ala Leu Met Ala Leu Glu Asp Lys Lys Pro 
195 200 205 



Glu He Gly Lys Glu Ala He Leu Lys Leu Leu Asp Ala Val Asp Glu 
35 210 215 220 

His He Pro Thr Pro Ser Arg Asp Leu Glu Gin Pro Phe Leu Leu Pro 
225 230 235 240 

40 Val Glu Asp Val Phe Ser He Ser Gly Arg Gly Thr Val Val Thr Gly 

245 250 255 

Arg Val Glu Arg Gly Val Leu Lys Lys Gly Glu Glu He Glu He Val 
260 265 270 



Gly Gly Phe Asp Lys Pro Tyr Lys Thr Thr Val Thr Gly He Glu Met 
275 280 285 



Phe Lys Lys Glu Leu Asp Ser Ala Met Ala Gly Asp Asn Cys Gly Val 
50 290 295 300 

Leu Leu Arg Gly Val Lys Arg Asp Glu He Lys Arg Gly Met Val Leu 
305 310 315 320 

55 Ala Lys Pro Gly Thr Ala Thr Ser His Lys Lys Phe Leu Ala Ser Leu 

325 330 335 



Tyr He Leu Thr Ser Glu Glu Gly Gly Arg Ser Thr Pro Phe Gly Glu 
340 345 350 

Gly Tyr Lys Pro Gin Cys Phe Phe Arg Thr Asn Asp Val Thr Thr Thr 
355 360 365 
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Phe Ser Phe Pro Glu Gly Glu Gly 
370 375 

Pro Gly Asp Asn lie Glu Met Val 
385 390 

Leu Glu Val Asn Gin Arg Phe Asn 
405 

Gly Thr Gly Leu lie Thr Arg lie 
420 



Val Asp His Ser Gin Met lie Met 
380 

Gly Glu Leu lie Lys Ser Cys Pro 
395 400 

Leu Arg .Glu Gly Gly Lys Thr Val 
410 415 

lie Glu 
425 



<210> 12 
<211> 699 
<212> PRT 

<213> Candida albicans 
<400> 12 

Met Glu Val Thr Gin Arg Thr Gin Ser Gin Thr Gin Pro Thr Gin Gin 
15 10 15 

Ser Pro Thr Thr Gin Thr Gin Thr Gin Ser Lys Glu Asp Gin Asn Arg 
20 25 30 

lie Cys Gin Leu lie Cys Ser Thr Gly Gin Phe Gly Asn Tyr Asp Leu 
35 40 45 

Asn lie Asn Asp Lys Thr lie Val Gin Gly Lys Met Thr Trp Tyr Phe 
50 55 60 

Gly Arg Asp . Pro_.Asn Ser . Asp . Leu Gin Val Ala Ser .Ser Ser _Arg lie _ 
65 70 75 80 

Ser Asn Lys His Phe Gin lie Trp Leu Asn Phe Asn Asp Lys Ser Leu 
85 90 95 

Trp lie Lys Asp Thr Ser Thr Asn Gly Thr His Leu Asn Asn Ser Arg 
100 105 110 

Leu Val Lys Gly Ser Asn Tyr Leu Leu Asn Gin Gly Asp Glu lie Ala 
115 120 125 

Val Gly Val Gly Arg Asp Glu Asp Val Val Arg Phe Val Val Val Phe 
130 135 140 

Gly Asp Lys Tyr Asn Pro Ala Lys Leu Pro Asp Ser Thr Asn Thr lie 
145 150 155 160 

Lys Asp Glu Gly lie Tyr Lys Asp Phe lie Val Lys Asn Glu Thr lie 
165 170 175 

Gly Gin Gly Ala Phe Ala Thr Val Lys Lys Ala lie Glu Arg Ser Thr 
180 185 190 

Gly Glu Ser Tyr Ala Val Lys lie lie Asn Arg Arg Lys Ala Leu Asn 
195 200 205 

Thr Gly Gly Gly Ser Ala Met Ala Gly Val Asp Arg Glu Leu Ser lie 
210 215 220 



Leu Glu Arg Leu Asn His Pro Asn He Val Ala Leu Lys Ala Phe Tyr 
225 230 235 240 

Glu Asp Met Asp Asn Tyr Tyr He Val Met Glu Leu Val Pro Gly Gly 
245 250 255 

Asp Leu Met Asp Phe Val Ala Ala Asn Gly Ala He Gly Glu Asp Ala 
260 265 270 

Thr Gin Val He Thr Lys Gin He Leu Glu Gly He Ala Tyr Val His 
275 280 285 

Asn Leu Gly He Ser His Arg Asp Leu Lys Pro Asp Asn He Leu He 
290 295 300 

Met Gin Asp Asp Pro He Leu Val Lys He Thr Asp Phe Gly Leu Ala 
305 310 315 320 

Lys Phe Ser Asp Asn Ser Thr Phe Met Lys Thr Phe Cys Gly Thr Leu 
325 330 335 

Ala Tyr Val Ala Pro Glu Val He Thr Gly Lys Tyr Gly Ser Ser Gin 
340 345 350 

Met Glu Ser Gin Gin Lys Asp Asn Tyr Ser Ser Leu Val Asp He Trp 
355 360 365 

Ser Leu Gly Cys Leu Val Tyr Val Leu Leu Thr Ser His Leu Pro Phe 
370 375 380 

Asn Gly Lys Asn Gin Gin Gin Met Phe Ala Lys He Lys Arg Gly Glu 
385 390 395 400 

Phe His Glu Ala Pro Leu Asn Ser Tyr Asp He Ser Glu Asp Gly Arg 
405 410 415 

Asp Phe Leu Gin Cys Cys Leu Gin Val Asn Pro Lys Leu Arg Met Thr 
420 425 430 

Ala Ala Glu Ala Leu Lys His Lys Trp Leu Gin Asp Leu Tyr Glu Glu 
435 440 445 

Asp Ser Val Lys Ser Leu Ser Leu Ser Gin Ser Gin Ser Gin Gin Ser 
450 455 460 

Arg Lys He Asp Asn Gly He His He Glu Ser Leu Ser Lys He Asp 
465 470 475 480 

Glu Asp Val Met Leu Arg Pro Leu Asp Ser Glu Arg Asn Arg Lys Ser 
485 490 495 

Ser Lys Gin Gin Asp Phe Lys Val Pro Lys Arg Val He Pro Leu Ser 
500 505 510 

Gin His Pro Ala Thr Pro Leu Pro Met Ser Gin Pro Lys Lys Arg Pro 
515 520 525 

Tyr Gin He Asp Pro Arg Thr Asn Lys Lys Val Asp Leu Glu Glu Pro 
530 535 540 

Ser Thr Ser Lys Lys Val Lys Leu Ser Asp Ser Val Val Ala Glu Asp 
545 550 555 560 



Tyr Leu Lys Leu 



Asn lie Ser Lys 
580 

Cys Glu lie Asp 
595 

Lys Glu Asn Asp 
610 

Leu Val Asn Asn 
625 

Gly Gly Glu lie 



lie Gly Phe Lys 
660 

Ser Gin Val Glu 
675 

Pro Leu lie Ser 
690 



Gly Pro Leu Ala 
565 

Ser Pro Phe Ser 



Asp Asp Arg Leu 
600 

Ser lie Trp Leu 
615 

Thr Ser Val Gly 
630 

Leu His Leu Phe 
645 

Val Val Leu Val 



Val Leu Lys Gin 
680 

Gly Leu Ser Ser 
695 



Asn Ser Leu Phe 
570 

Phe Gly Arg Asn 
585 

Ser Lys Leu His 



Leu Asp Lys Ser 
620 

Lys Gly Asn Lys 
635 

Phe Asp Pro Leu 
650 

Asp Gin Ser Ser 
665 

Thr Ser Glu Glu 



He Ser Ser 



Gin Glu Thr He 
575 

Asp Thr Cys Asp 
590 

Cys Val He Thr 
605 

Thr Asn Ser Cys 



Val Leu Leu Arg 
640 

Ser Ser Gin His 
655 

Gly Glu His Lys 
670 

Met Asn He He 
685 



<210> 13 
<211> 295 
<212> PRT 

<213>- Candida albicans - - - _ . . . _ 

<400> 13 

Met Gly Thr Ser Thr Ser Glu Ala Leu Lys Asn He Lys Asn Lys Gin 
1 5. 10 15 

Arg Arg Gin Lys Val Phe Ala Glu He Lys His Glu Lys Asn Lys Gin 
20 25 30 

Arg His Lys Gin Arg Ala Glu Arg Ala Lys Glu Glu Arg Glu Asn Pro 
35 40 ^ 45 

Glu Leu Arg Glu Glu Arg He Ala Ala Asn He Pro Asp Thr He Asp. 
50 55 60 

Ser Lys Arg He Tyr Asp Glu Thr He Ala Ala Glu Val Glu Gly Asp 
65 70 75 80 

Asp Glu Phe Gin Ser Tyr Phe Thr Asn Leu Leu Glu Glu Pro Lys He 
85 90 95 

Leu Leu Thr Thr Ser Ala Asn Ala Lys Lys Pro Ala Tyr Glu Phe Ala 
100 105 110 

Asp Met He Met Asp Phe Leu Pro Asn Val Thr Phe He Lys Arg Lys 
115 120 125 



Lys Glu Tyr Thr Met Gin Asp Met Ala Lys Tyr Cys Ser Asn Arg Asp 
130 135 140 
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Phe Thr Ala Leu 
145 

lie Thr Leu lie 



Thr Ser lie Val 
180 

Asp Tyr Leu Pro 
195 

Lys Thr Val Gly 
210 

Leu Gin Gly Arg 
225 

Phe Phe Arg Arg 



Leu Gin Glu Gly 

260 

Gly Val Arg Gly 
275 

Asp Lys Lys Lys 
290 



Leu Val lie Asn 
150 

Asn Leu Pro Glu 
165 

Asp Gly Lys Arg 



Glu lie Val Leu 
200 

Arg Leu Phe Gin 
215 

Gin Val lie Thr 
230 

His Arg Tyr lie 
245 

Pro Gin Phe Thr 



Asp Val Val Trp 
280 

Phe Tyr Leu 
295 



Glu Asp Lys Lys 
155 

Gly Pro Thr Phe 
170 

lie Lys Gly His 
185 

Asn Asn Phe Asn 



Ser lie Phe Pro 
220 

Leu His Asn Gin 
235 

Phe Arg Asn Glu 
250 

Leu Lys Leu Arg 
265 

Glu His Arg Pro 



Lys Val Asn Gly 
160 

Tyr Phe Ser lie 
175 

Gly Lys Ala Gly 
190 

Ser Arg Leu Gly 
205 

His Lys Pro Glu 



Arg Asp Tyr lie 
240 

Glu Lys Val Gly 
255 

Arg Met Gin Lys 
270 

Asp Met Glu Arg 
285 



<210> 14 

<211> 55 - - - -- - - --- - 

<212> PRT 

<213> Candida albicans 
<400> 14 

Met Asn Ser Glu Lys lie lie Glu Val lie lie Ala lie Phe Leu Pro 
15 10 15 

Pro Val Ala Val Phe Met Lys Cys Gly Ala Thr Thr Pro Leu Trp lie 
20 25 30 

Asn Leu Val Leu Cys lie Phe lie Trp Phe Pro Ala lie Leu His Ala 
35 40 45 

Leu Tyr Val Val Leu Lys Asp 
50 55 



<210> 15 
<211> 773 
<212> PRT 

<213> Candida albicans 
<400> 15 

Met Thr Leu Gly Phe Asp Lys Phe lie Ser Lys Val Ser Thr His Arg 
15 10 15 

Arg Gin Ser Glu Pro Ser lie Leu Glu lie Ala Ala Thr Asn Ser Gin 
20 25 30 
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Asn Lys Ser Arg Arg Leu Ser Met Asp Asn Gly His Cys Tyr Val Arg 
35 40 45 



Glu Ser Thr Asn 
50 

Arg Gin Arg Ala 
65 

His Gin Asn Ser 



Gly Ala Arg lie 
100 

Gly Glu Ala lie 
115 

Leu Val Lys Gin 
130 

Met Leu Gin Asp 
145 

Ser Gin Gly Ser 



Asn Asn Cys lie 
180 

Glu Asp Asp Glu 
195 

Glu Met Pro Thr 
210 

Asn His Ser Glu 
225 

Phe Ser Lys Met 



lie Asp Ser Glu 
260 

Leu Thr Lys Asp 
275 

Ser Leu Thr Thr 
290 

His Ala Lys Glu 
305 

Leu Ser Asp Ala 



Lys Ser Lys Asn 
340 

Tyr Lys Leu lie 
355 



Asn His His His 
55 

Gly Ser Phe Ser 
70 

His Asp Asn Tyr 
85 

Ser Arg Ser Arg 



Ser Pro Ala Tyr 
120 

Glu Thr Ala His 
135 

Leu Asp Leu Gin 
150 

Glu Ser Lys Phe 
165 

Tyr Leu Pro Ala 



Asn Gly Gly Val 
200 

Ala Val Asn Asn 
215 

Thr Asp Phe Ser 
230 

Lys Ser Phe His 
245 

Cys Pro lie Pro 



Ser Leu lie lie 
280 

lie Leu Trp Pro 
295 

Lys Phe Thr lie 
• 310 

Asp Tyr Tyr lie 
325 

Leu Gly Pro Glu 



Asp lie Glu Glu 
360 



Leu Asn Thr Val 
60 

Phe lie Ser His 
75 

Thr Val Asp Pro 
90 

Ser Arg Ser Lys 
105 

Phe Ser Lys Asn 



lie lie Ser Lys 
140 

Asn Pro lie Ala 
155 

Cys Lys lie Tyr 
170 

Ala Ser Ser Thr 
185 

lie lie Ala Glu 



Asn Thr Leu Ser 
220 

Asp Ser Pro Pro 
235 

Ser Pro Asn Tyr 
250 

His Thr Phe Ala 
265 

Lys Asp Leu His 



Thr Gly Asp Ala 
300 

Gly Asn Met Glu 
315 

Asn Ser Ser Asn 
330 

Asp Leu lie Asn 
345 

Pro Asn Asn Ser 



Val Asp Asn Leu 



His His Asn His 
80 

Leu Thr Ser Asn 
95 

Ser Val Gly His 
110 

Lys Thr Lys Asp 
125 

Lys Leu Leu Asn 



Leu Lys Thr lie 
160 

Val Ser Asn Thr 
175 

Ser Phe Thr Tyr 
190 

Asp Arg Asn Asp 
205 

Met Asp Ser lie 



Pro Pro Asp Leu 
240 

Leu Thr Ser Lys 
255 

Val lie Val Glu 
270 

Phe Gin Phe Gin 

285 

Tyr Asn Arg Thr 



Trp Arg Thr Ser 
320 

Ser Asn Asp Val 
335 

Arg Thr Arg Glu 
350 

Ser Asn Ser Leu 
365 
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Ser Asp Asp Asp Met Asp lie Asn Asn. lie Thr Ser Pro Leu Ser Thr 
370 375 380 

Ser Pro Thr Ser Ser Ser Thr Ser Thr Asn Ser Thr Ser Asn Ser Leu 
385 390 395 400 

Gly Ser Asp Ser Tyr Lys Ala Gly Leu Tyr Val Phe Leu Leu Pro lie 
405 410 415 

Leu Leu Pro Glu His lie Pro Ala Ser lie Val Ser lie Asn Gly Ser 
420 425 430 

Leu Ala His Thr Leu Ser Val Glu Cys Asn Lys Tyr Thr Asp Lys Leu 
435 440 445 

Asn Arg Lys Ser Lys Val Ser Ala Ser Tyr Asn Leu Pro Met Val Arg 
450 455 460 

Thr Pro Pro Asn lie Gly Asn Ser lie Ala Asp Lys Pro lie Tyr Val 
465 470 475 480 

Asn Arg lie Trp Asn Asp Ala Val His Tyr lie lie Thr Phe Pro Arg 
485 490 495 

Lys Tyr Val Thr Leu Gly Cys Glu His Met lie Asn Val Lys Leu Ser 
500 505 510 

Pro Met Val Lys Asp Val Val lie Lys Arg lie Lys Phe Asn Val Leu 
515 520 525 

Glu Arg lie Thr Tyr Val Ser Lys Asn Leu Ser Arg Glu Tyr Asp Tyr 

- 530 - 535 - . - - 540 - - - - 

Asp Ser Glu Asp Pro Tyr Cys lie His Pro Val Ser Lys Glu Asn Lys 
545 550 555 560 

Val Arg Glu Arg Val Val Ser Leu Tyr Glu Leu Lys Thr Lys Ala Lys 
565 570 575 

Gin Ser Ser Gly Gly His Leu Glu Ala Tyr Lys Gin Glu Val Met Lys 
580 585 590 

Cys Pro Glu Asn Asn Leu Leu Phe Ser Cys Tyr Glu Val Glu Asn Asp 
595 600 605 

Asn Asn Asn Gly Asn Gly Asn Gly Asn Gly Asn Gly Asn Lys Asn Val 
610 615 620 

Lys Gin Lys Asn Lys Asp Gin Pro Met lie Ala Thr Pro Leu Asp lie 
625 630 635 640 

Asn Val Ser Leu Pro Phe Leu Thr Thr Met Ser Asp Ser Leu lie Met 
645 650 655 

Thr Ser Ala lie Glu Glu Glu Gly Ser Asp Ser Pro His Thr Ser Arg 
660 665 670 

Arg Gly Ser Ala Val Ser Met Thr Asp Asn Asn Thr Thr Pro Ser Asn 
675 680 685 



Asn Asn Pro Leu Ser Pro Phe Leu Gly Ala Val Glu Thr Asn Gly Ala 



690 

Ser He Asn Glu He 
705 

Arg His He Glu He 
725 

Lys Pro Asp Ser Asp 
740 

Thr Pro He Val Leu 
755 

Pro Tyr Ser Ser Val 
770 
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695 

Gly Asp His Thr Leu Phe 
710 715 

Lys His Arg Leu Gin Val 
730 

Asn Lys Met His His Tyr 
745 

Leu Ser Ser Lys Cys Gin 
760 
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700 

Pro Asp Ser Asn Phe 
720 

Thr Phe Arg He Ser 
735 

Glu Val Val He Asp 
750 

Glu Asp Ser Pro Pro 
765 



<210> 16 
<211> 90 
<212> PRT 

<213> Candida albicans 
<400> 16 

Met Gly Glu Gly Thr Pro Ser Leu Gly Lys Arg His Asn Lys Ser His 
15 10 15 

Thr Leu Cys Asn Arg Cys Gly Arg Arg Ser Phe His Val Gin Lys Lys 
20 25 30 

Thr Cys Ser Ser Cys Gly Tyr Pro Ala Ala Lys Met Arg Ser His Asn 
35 40 45 



Trp Ala Leu Lys Ala Lys Arg Arg Arg Thr Thr Gly Thr Gly Arg Met 
50 55 60 

Ala Tyr Leu Lys His Val Thr Arg Arg Phe Lys Asn Gly Phe Gin Thr 
65 70 75 80 

Gly Val Ala Lys Ala Gin Thr Pro Ser Ala 
85 90 



<210> 17 
<211> 19 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 17 

tgcagctcga cctcgactg 19 



<210> 18 
<211> 19 
<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence : primer 



■C 
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10 



25 



<400> 18 

gcgtgaatgt aagcgtgac 19 



<210> 19 
<211> 20 
<212> DNA 

<213> artificial sequence 



<220> 

<223> Description of Artificial Sequence : primer 



<400> 19 
15 tgagcagctc gccgtcgcgc 20 



<210> 20 
<211> 22 
20 <212> DNA 

<213> artificial sequence 

<220> 

<223> Description of Artificial Sequence : primer 
<400> 20 

gagttatacc ctgcagctcg ac 22 



30 <210> 21 

<211> 6671 
<212> DNA 

<213> Artificial- Sequence - - 

35 <220> 

<223> Description of Artificial Sequence :DNA plasmid 
<400> 21 

ttccatcggg gaaagtgggg gggaaaaaat tttaagcagt tcacaaaacc ttccaaaaaa 
40 60 

tatatggaca aagatgattg tattttcccg acaccaaaat cataattaat tatgagaaag 
120 

ttaaatgtaa cgttacaatt tatgtttatt tgaaggtgaa aagcgattta tgatttttcc 
180 

45 gaaatgaaaa ttttttttag gtttattttt tttgtcgggc aaagaaaaac tgaacaagga 
240 

ttattaaaat ttttggtgtt tgtttgtgtc tggagaattc attcctctct catcttcaca 
300 

caatgtttag acatctgaca cgattcatga tagttcggtt tccggggttg gtgtttagtt 
50 360 

ttcgtttttc tttttttttg gaaagaatgt tttagctcat tggttttctt tcttcattca 
420 

atagttttga aagaatttgc ccacttgtta ttacaatcat ataaaattaa actttgatat 
480 

55 aaaatagagt ttgaaagttt cccagatcct ttttgatttc tttgtaaatt tttttttctc 
540 

ccacatatac acacatacaa accgattttt ataagaaaga gttataccct gcagctcgac 
600 

ctcgactgtt taaacctgca ggcatgcaag cttggccaaa aaggcctcga ggaacatgac 
60 660 

caacaagtgt ctcctccaaa ttgctctcct gttgtgcttc tccactacag ctctttccat 
720 
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gagctacaac ttgcttggat tcctacaaag aagcagcaat tttcagtgtc agaagctcct 
780 . 

gtggcaattg aatgggaggc ttgaatactg cctcaaggac aggatgaact ttgacatccc 
840 

5 tgaggagatt aagcagctgc agcagttcca gaaggaggac gccgcattga ccatctatga 
900 

gatgctccag aacatctttg ctattttcag acaagattca tctagcactg gctggaatga 
960 

gactattgtt gagaacctcc tggctaatgt ctatcatcag ataaaccatc tgaagacagt 
10 1020 

cctggaagaa aaactggaga aagaagattt caccagggga aaactcatga gcagtctgca 
1080 

cctgaaaaga tattatggga ggattctgca ttacctgaag gccaaggagt acagtcactg 
1140 

15 tgcctggacc atagtcagag tggaaatcct aaggaacttt tacttcatta acagacttac 
1200 

aggttacctc cgaaactgaa gatctcctag cctgtgcctc tgggactgga caattgcttc 
1260 

aagcattctt caaccagcag atgctgttta agtgactgat ggctaatgta ctgcatatga 
20 1320 

aaggacacta gaagattttg aaatttttat taaattatga gttattttta tttatttaaa 
13 80 

ttttattttg gaaaataaat tatttttggt gcaaaagtcc ctcgaggcct agcggccgcc 
1440 

25 tagaggatcc ccgggcgcta ggcggccgct aggccttttt ggccgaattc gagctcggta 
1500 

cccggggaga tccgtccccc ttttcctttg tcgatatcat gtaattagtt atgtcacgct 
1560 

tacattcacg ccctcccccc acatccgctc taaccgaaaa ggaaggagtt agacaacctg 
30 1620 

aagtctaggt ccctatttat ttttttatag ttatgttagt attaagaacg ttatttatat 
1680 

-ttcaaatttt tctttttttt . ctgtacagac- gcgtg.tacgc atgtaacatt atactgaaaa. . 

1740 

35 ccttgcttga gaaggttttg ggacgctcga aggctttaat ttgcaagcta gcttggcgta 
1800 

atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat 
1860 

acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt 
40 1920 

aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agagatctct 
1980 

gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc 
2040 

45 ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagatcg 
2100 

atctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa 
2160 

catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 
50 2220 

tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 
2280 

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 
2340 

55 ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 
2400 

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 
2460 

caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 
60 2520 

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 
2580 
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taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 
2640 

taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 
2700 

5 cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 
2760 

tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 
2820 

gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 
10 2880 

catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa 
2940 

atcaatctaa agaagtggat ctaggaaaat ttaattttta cttcaaaatt tagttagatt 
3000 

15 agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 
3060 

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 
3120 

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 
20 3180 

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 
3240 

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 
3300 

25 agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 
3360 

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 
3420 

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 
30 3480 

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 
3540 

actgtcatgc- catccgtaag atgcttttct gtgactggtg- agtactcaac caagtcattc - 
3600 

35 tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 
3660 

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 
3720 

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 
40 3780 

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 
- 3840 

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 
3900 

45 tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 
3960 

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 
4020 

gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 
50 4080 

ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg 
4140 

gagacggtca cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg 
4200 

55 tcagcgggtg ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta 
4260 

ctgagagtgc accatatcga cgctctccct tatgcgactc ctgcattagg aagcagccca 
4320 

gtagtaggtt gaggccgttg agcaccgccg ccgcaaggaa tggtgcatgc aaggagatgg 
60 4380 

cgcccaacag tcccccggcc acggggcctg ccaccatacc cacgccgaaa caagcactaa 
4440 



taggaattga tttggatggt ataaacggaa 
4500 

aaaattattt tattatttga ttttatttaa 
4560 

5 ttttgttgaa agttgctgta gtgccattga 
4620 

ctcttgtttg atagtatcca aaaaaacggc 
4680 

tatacagata acataatgaa agaaaaaatc 
10 4740 

ccattctttt aaacattgat caattcctga 
4800 

ccgccccttt tacagttgaa gaaagaaata 
4860 

15 agtcaacact aagacctata gtgagagagc 
4920 

attatttcga ttaatggaac tgaagaaaac 
4980 

cactaaggag ttcctcgagt taattgataa 
20 5040 

tcatattgat ataatcaatg atttttccta 
5100 

ttcacgtaaa catcaattta tgatttttga 
5160 

25 cgtaaagaaa caatatattg gtggagttta 
5220 

tgctcatggt gtcactggga atggagtggt 
5280 

caccaccaac caagagccaa gagggttatt 
30 5340 

attagcatat ggagaatatt ctcaaaaaac 
5400 

- -- tgttattgga- tttattgccc aacgtgatat 
5460 

35 tattatgaca cctggagttg gattagatga 
5520 

aactgttgat gaagttgtta gcactggaac 
5580 

tggtaaagga agagatccag atattgaagg 
40 5640 

ttatttgaaa aagactggcc aattataaat 
5700 

ttgtatatat gtagaataaa taaataaata 
5760 

45 attattacta tttacaatca aaggtggtcc 
5820 

acattcatca gtgtaaaaat ggaatcaata 
5880 

agcccgatct tccccatcgg tgatgtcggc 
50 5940 

gccgcagcgc gcagggtcag cctgaatacg 
6000 

aggtcagaat agcccaagtc ggccgagggg 
6060 

55 cgaagaggaa ccaatgtaac gttacactga 
6120 

tgtaaaagtg tgaaaataat ttttgaatat 
6180 

gtgttttttt tagagaatgg gaattcttat 
60 6240 

ctgtgcacaa aaacgtttgg atggatgatc 
6300 
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acaaaaaaaa 


gagctggtac 


tactttcttt 


tagtatatat 


tatattttga 


acgtagatta 


ttcgtaacac 


taattctgta 


ttagtcattc 


tatttttttg 


caatcttatt 


tcctgcatat 


tttttttttg 


ttcttcaatg 


atgatttcaa 


gcaacaaccc 


catacacact 


ggtttatata 


gaaatagaaa 


tagcaaacaa 


aagatatgac 


agaaactcat 


gcctcaccag 


tagcacagcg 


caatttatgt 


gcatcaattg 


acgttgatac 


attaggtcct 


tatgtatgct 


taatcaagac 


tgaatccact 


attgaaccat 


tattagaact 


agatagaaaa 


tttgctgata 


ttggtaatac 


taaaattagt 


agttgggcag 


atattaccaa 


tgaaggatta 


aaacagggag 


ctaaagaaac 


gatgttagct 


gaattatcat 


cagtgggatc 


tgttgaaatt 


gctaaatccg 


ataaggaatt 


gggtggccaa- 


gaagaaggat 


ttgattggct 


taaaggtgat 


ggattaggac 


aacaatatag 


tgatattatc 


attgttggta 


gaggattgtt 


taaaaggtat 


agaaatgctg 


gttggaatgc 


gtgaaggggg 


agattttcac 


tttattagat 


agttaaataa 


ataattaaat 


aagggtggta 


ttctagctgt 


aatccgggca 


gcgcaacgga 


aagccctgcg 


ctcatgagcc 


cgaagtggcg 


gatataggcg 


ccagcaaccg 


cacctgtggc 


cgtttaatga 


ccagcacagt 


cgtgatggca 






r't'aa tat tQa 


agaaaacaca 


caataaacgg 


gaagaaacgg 


catttccctt 


ggtttaattc 


caaacgaaac 


tggatgtcta 


gattgtttgt 


ttactccaga 


agaagatatt 


tttaggctta 


gctctaaata 
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taagaaatga tgcttgaaaa accagacaga aattgagttt caaaaattgg taatgtgagg 
6360 

tattagtcaa ctaaccaaat aacaatgcaa accggttgat acatttcatt ttgaaaataa 
6420 

5 tgaaactgga attggatgac cagcacacaa acacataaag taattatggg aattagaagc 
6480 

gaacatagag gagtacttgg ccacgaacag aatacaagtg ggaacactat tttctccatt 
6540 

gttttagttc tgtttttttg tcagcctagt tttgtgctat gtgtaaaaaa tattgccaag 
10 6600 

aaaaaaagct tgttttgtgg ccagtgtccg aaaaaaattt tggggaatct tcggattaat 
6660 

ttatgttttc a 
6671 

15 

<210> 22 
<211> 7127 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Description, of Artificial Sequence :DNA plasmid 
25 <400> 22 

agcttgagta ttctatagtg tcacctaaat agcttggcgt aatcatggtc atagctgttt 
60 

cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag 
120 

30 tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 
180 

cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 

- -240 - - - - - --- - - -- - ----- - _ _ . _ 

gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 
35 300 

tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 
360 

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 
420 

40 aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 
480 

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 
540 

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 
45 600 

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 
660 

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 
720 

50 cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 
780 

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 
840 

ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt 
55 900 

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 
960 

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 
1020 

60 agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 
1080 
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aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 
1140 

atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 
1200 

5 tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 
1260 

tcatccatag ttgcctgact ccccgtcgtg tagataacta gacccacgct caccggctcc 
1320 

agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 
10 1380 

tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 
1440 

agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 
1500 

15 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 
1560 

catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 
1620 

ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 
20 1680 

atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 
1740 

tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 
1800 

25 cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 
1860 

cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 
1920 

atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 
30 1980 

aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 
2040 

- ttgaagcatt- tatcagggtt attgtctcat gagcggatac atatt.tgaat g.tatttagaa . __ _ . 

2100 

35 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 
2160 

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtct 
2220 

cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg agacggtcac 
40 2280 

agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt 
2340 

tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac tgagagtgca 
2400 

45 ccatatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgaaa 
2460 

ttgtaaacgt taatattttg ttaaaattcg cgttaaatat ttgttaaatc agctcatttt 
2520 

ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag 
50 2580 

ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg 
2640 

tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcacccaaat 
2700 

55 caagtttttt gcggtcgagg tgccgtaaag ctctaaatcg gaaccctaaa gggagccccc 
2760 

gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga 
2820 

aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac 
60 2 88 0 

ccgccgcgct taatgcgccg ctacagggcg cgtccattcg ccattcaggc tgcgcaactg 
2940 



ttgggaaggg cgatcggtgc gggcctcttc 
3000 

tgctgcaagg cgattaagtt gggtaacgcc 
3060 

5 gacggccagt gaattgtaat acgactcact 
3120 

acttttaaag ttctgctatg tggcgcggta 
3180 

ctcggtcgcc gcatacacta ttctcagaat 
10 3240 

tggatggtat aaacggaaac aaaaaaaaga 
3300 

ttatttgatt ttatttaata gtatatatta 
3360 

15 ttgctgtagt gccattgatt cgtaacacta 
3420 

agtatccaaa aaaacggcta tttttttgca 
3480 

ataatgaaag aaaaaatctt tttttttgtt 
20 3540 

acattgatca attcctgagc aacaacccca 
3600 

cagttgaaga aagaaataga aatagaaata 
3660 

25 gacctatagt gagagagcag aaactcatgc 
3720 

aatggaactg aagaaaacca atttatgtgc 
3780 

ccttgaatta attgataaat taggtcctta 
30 3840 

aatcaatgat ttttcctatg aatccactat 
3900 

-- - - tcaatttatg- atttttgaag. atagaaaatt 
3960 

35 atatattggt ggagtttata aaattagtag 
4020 

cactgggaat ggagtggttg aaggattaaa 
4080 

agagccaaga gggttattga tgttagctga 
40 4140 

agaatattct caaaaaactg ttgaaattgc 
4200 

tattgcccaa cgtgatatgg gtggccaaga 
4260 

45 tggagttgga ttagatgata aaggtgatgg 
4320 

agttgttagc actggaactg atattatcat 
4380 

agatccagat attgaaggta aaaggtatag 
50 4440 

gactggccaa ttataaatgt gaagggggag 
4500 

agaataaata aataaataag ttaaataaat 
4560 

55 tacaatcaaa ggtggtcctt ctagctgtaa 
4620 

gtaaaaatgg aatcaataaa gccctgcgca 
4680 

atgaccagca cagtcgtgat ggcaaggtca 
60 4740 

cagtgaggga agatctgata ttgacgaaga 
4800 
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gctattacgc cagctggcga aagggggatg 
agggttttcc cagtcacgac gttgtaaaac 
atagggcgaa ttggttttcc aatgatgagc 
ttatcccgtg ttgacgccgg gcaagagcaa 
gacttggttg agtactaata ggaattgatt 
gctggtacta ctttctttaa aattatttta 
tattttgaac gtagattatt ttgttgaaag 
attctgtatt agtcattcct cttgtttgat 
atcttatttc ctgcatatta tacagataac 
cttcaatgat gatttcaacc attcttttaa 
tacacactgg tttatatacc gcccctttta 
gcaaacaaaa gatatgacag tcaacactaa 
ctcaccagta gcacagcgat tatttcgatt 
atcaattgac gttgatacca ctaaggaatt 
tgtatgctta atcaagactc atattgatat 
tgaaccatta ttagaacttt cacgtaaaca 
tgctgatatt ggtaataccg . taaagaaaca _ __ 
ttgggcagat attaccaatg ctcatggtgt 
acagggagct aaagaaacca ccaccaacca 
attatcatca gtgggatcat tagcatatgg 
taaatccgat aaggaatttg ttattggatt 
agaaggattt gattggctta ttatgacacc 
attaggacaa caatatagaa ctgttgatga 
tgttggtaga ggattgtttg gtaaaggaag 
aaatgctggt tggaatgctt atttgaaaaa 
attttcactt tattagattt gtatatatgt 
aattaaataa gggtggtaat tattactatt 
tccgggcagc gcaacggaac attcatcagt 
gcgcgcaggg tcagcctgaa tacgcgttta 
gaatagccca agtcggccga ggggcctgta 
ggaaccaatg taacgttaca ctgaagaaaa 



cacataataa acgggaagaa acggtgtaaa 
4860 

ccttggttta attccaaacg aaacgtgtat 
4920 

5 gtctagattg tttgtttact ccagactgtg 
4980 

atatttttag gcttagctct aaatataaga 
5040 

agtttcaaaa attggtaatg tgaggtatta 
10 5100 

ttgatacatt tcattttgaa aataatgaaa 
5160 

taaagtaatt atgggaatta gaagcgaaca 
5220 

15 aagtgggaac acttttttct ccattgtttt 
5280 

gctatgtgta aaaaaatatt gccaagaaaa 
5340 

aaattttggg gaagcttcgg attaatttat 
20 5400 

aaaaaaaatt taagcagttc ataaaacctt 
5460 

ttttcccgac accaaaatca taattaacta 
5520 

25 tttttatttg aagctgaaaa gcgatttatg 
5580 

ttattttttt tgtcgggcaa agaaaaactg 
5640 

tttgtgtctg gagaattcat tcctctctca 
30 5700 

attcaaaata gttcggtttc cggggttggt 
5760 

- -aagaatgttt tagctcattg -gttttctttc 
5820 

35 acttgttatt acaatcatat aaaattaaac 
5880 

cagatccttt ttgatttctt tgtaattttt 
5940 

gatttttata agaaagagtt ataccctgca 
40 6000 

atgcggccgc taggcctcga gggacttttg 
6060 

atttaaataa ataaaaataa ctcataattt 
6120 

45 ttcatatgca gtacattagc catcagtcac 
6180 

tgaagcaatt gtccagtccc agaggcacag 
6240 

tgtaagtctg ttaatgaagt aaaagttcct 
50 6300 

acagtgactg tactccttgg ccttcaggta 
6360 

gtgcagactg ctcatgagtt ttcccctggt 
6420 

55 gactgtcttc agatggttta tctgatgata 
6480 

ctcattccag ccagtgctag atgaatcttg 
6540 

ctcatagatg gtcaatgcgg cgtcctcctt 
60 6600 

agggatgtca aagttcatcc tgtccttgag 
6660 
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agtgtgaaaa taatttttga atatcatttc 
ttttttagag aatgggaatt cttattggat 
cacaaaaacg tttggatgga tgatcagaag 
aatgatgctt gaaaatccag acagaaattg 
gtcaactaac caaataacaa tgcaaaccgg 
ctggaattgg atgaccagca cacaaacaca 
tagaggaata ctttgccacg aacagaatac 
agttctgttt ttttgtcaaa ctggttttgt 
aaagcttgtt ttgtggccag tgtccgaaaa 
ttttttattc catcggggaa agtggggggg 
ccaaaaaata tatggacaga gatgattgta 
tgagaaaatt gaatgtaacg ttacaattta 
atttttccga aatgaaaatt ttttttaggt 
aacaaggatt attaaaattt ttggtgtttg 
tcttcacaca atgtttagac atctgacacg 
gtttagtttt cgtttttcgt tttttttgga 

-ttcattcaat- agttttgaaa gaat.ttgccc 

tttgatataa aatagagttt gaaagtttcc 
ttttctccca catatacaca catacaaacc 
gctcgacctc gagggatccg ggccctctag 
caccaaaaat aatttatttt ccaaaataaa 
aataaaaatt tcaaaatctt ctagtgtcct 
ttaaacagca tctgctggtt gaagaatgct 
gctaggagat cttcagtttc ggaggtaacc 
taggatttcc actctgacta tggtccaggc 
atgcagaatc ctcccataat atcttttcag 
gaaatcttct ttctccagtt tttcttccag 
gacattagcc aggaggttct caacaatagt 
tctgaaaata gcaaagatgt tctggagcat 
ctggaactgc tgcagctgct taatctcctc 
gcagtattca agcctcccat tcaattgcca 
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caggagcttc tgacactgaa aattgctgct tctttgtagg aatccaagca agttgtagct 
6720 

catggaaaga gctgtagtgg agaagcacaa caggagagca atttggagga gacacttgtt 
6780 

5 ggtcatgttc ctcgaggcct ttttggccag ctggcgcctg ctgcgcgacg gcgagctgct 
6840 

caccacccag gatccgtccc ccttttcctt tgtcgatatc atgtaattag ttatgtcacg 
6900 

cttacattca cgccctcccc ccacatccgc tctaaccgaa aaggaaggag ttagacaacc 
10 6960 

tgaagtctag gtccctattt atttttttat agttatgtta gtattaagaa cgttatttat 
7020 

atttcaaatt tttctttttt ttctgtacag acgcgtgtac gcatgtaaca ttatactgaa 
7080 

15 aaccttgctt gagaaggttt tgggacgctc gaaggcttta atttgca 
7127 
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